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Safety first 


Itis worrying that US government departments are unable to divulge basic data on research projects 


involving human subjects. Such data should be publicly available to ensure volunteers’ safety. 


stadium, travel on an aircraft or even head to the cinema 

these days without being told that someone, somewhere, 
is watching out for you. So why do some systems that are set up to 
protect the volunteers who participate in scientific research seem so 
inadequate? 

It's not as if we haven't been warned about what can happen when 
supervision and scrutiny are lax. In 2010, the US Presidential Com- 
mission for the Study of Bioethical Issues was tasked with a sobering 
mission. A series of horrifying medical experiments on Guatemalan 
citizens — some intentionally infected with syphilis — in the 1940s 
had recently come to light. President Barack Obama asked the com- 
mission to determine whether such an atrocity could still happen 
today, and to evaluate the protections in place for all who participate 
in human-subject research funded by the US government. 

The commission soon ran into a problem: a portrait of the current 
system was difficult to paint. Some government departments did not 
have ready access to essential data for identifying and tallying federally 
funded projects involving human subjects. More than six months after 
the commission asked for them, some departments were still unable 
to provide basic information, such as a list of all such projects, the 
number of participants involved and the location of the work. 

When it reported its findings in 2011, the commission concluded 
that current regulations probably protect research participants from 
unethical treatment. But it could not say so with certainty: “Because of 
the currently limited ability of some governmental agencies to identify 
basic information about all of their human subjects research, the Com- 
mission cannot say that all federally funded research provides optimal 
protections,” the report concluded. 

To improve the situation, and to help to secure the protection of all 
involved, the commission made a simple request. Any federal depart- 
ment or agency that supports human-subject research should make a 
core set of data publicly available, listing the research title, investigator, 
location and funding. The Department of Defense quickly complied, 
but some agencies still have not. That is not good enough. Although 
the bioethics commission cannot compel agencies to collate and gather 
this information, if these agencies are to preserve public trust and 
ensure future research, then they should all do so. 

The system that oversees human-subject research in the United 
States is already secretive at too many levels. For example, the ethics 
committees that assess risk and approve projects — institutional 
review boards — deliberate in private. Although there are guidelines 
on the types of expert who should sit on the boards, the guidelines are 
toothless because there is no independent system to check that they 
are followed. In 2009, investigators from the Government Account- 
ability Office reported that they had been able to register a bogus 
ethics committee with the Department of Health and Human 
Services. 


cc : our safety is our priority.” It’s difficult to visit a sports 


It is true that many federally funded medical trials are logged on the 
ClinicalTrials.gov website. But sponsors are not forced to register early, 
phase I studies, and the database extends to non-medical projects. 

Officials and government agencies are keen to talk up the benefits of 
gathering big data. Well, now they need to cough up some information 

themselves. And there is more to providing 


“The system these details than mere box-ticking. A reli- 
that oversees able map of where research is taking place 
US research on and what projects are under way allows 
human subjects researchers — and members of the public — 
is secretive to identify gaps and redundancies. It could 
at too many flag up populations of research participants 
levels.” who are being under- or over-sampled, and 


studies that are being conducted on popula- 
tions that may not receive the benefit of the results. 

Despite the struggle to obtain useful data, the bioethics commis- 
sion’s report was able to estimate that the government funded more 
than 55,000 projects involving human subjects in fiscal year 2010. 
Most of those were medical studies. 

It's time for the system to give a little back to the many thousands 
of volunteers who help researchers to advance these studies, some- 
times at risk to themselves — by lifting the veil of secrecy that limits 
oversight of such risks. Making the effort to fulfil the commission's 
recommendation is a good way for those in charge to start. m 


Generation game 


A Nature specialissue takes on the world of 
tomorrow — and the decisions shaping it today. 


wouldn't start from here. Perhaps the greatest trick that the 

film director George Lucas ever pulled was to set his Star Wars 
series not in the future, but a long time ago. Lucas’s emblematic take 
on once-upon-a-time introduced each film as entirely unconnected 
in space and time to the present day. Everybody on screen was long 
dead. Their lives and troubles and loves and hates were dust. The tales 
of heroism and noble deeds were essentially myths. 

Much science fiction does the opposite. It takes what we have 
now and spins it forward. Or it picks a destination and charts a 
course. Occasionally, the two narrative devices collide awkwardly, 
and present-day humans discover some futuristic technology, which 
they use to change their own path. But most of the time, even tales 
of aliens and interplanetary travel are presented as a consequence 


H= do we get to the future? As the old joke goes: well, I 
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ofa plausible series of likely events. 

Technologists will tell you that the future is already here, it is 
just unevenly distributed. But there is one factor that defies such 
a simplistic vision: humans. One day, in the not too distant future, 
everybody alive today will be dead. The planet will be inherited by 
people who had zero input into how Earth — their only home — was 
farmed, fished, burned, polluted, shaped and exhausted. Perhaps 
some of them are reading this. 

If so, the people of the future — those born in the late twenty-first 
century and beyond — may well scan this special issue of Nature with 
bewilderment or mocking nostalgia. In a series of articles starting 
on page 397, we tackle the ethics and opportunities of early-twenty- 
first-century science and technology and its impact on our future 
generations (see nature.com/futuregenerations). Gene editing, 
nuclear waste, climate change, the march of computers and popula- 
tion growth — decisions and paths embarked on today will resonate 
well into the future. 

Nature has long taken an interest in the fate of future generations 
and how science can improve — and endanger — them. Back in 
March 1870, an issue in the first volume of this journal carried a 
review of the book Hereditary Genius (Macmillan, 1869) by Francis 
Galton, who spawned the field of eugenics (see A. R. Wallace Nature 
1, 501-503; 1870). His book introduced claimed scientific concepts 
into what had previously been an economic and social debate about 
the relationship between present and future people. In Britain, this 


came during the era 
FUTURE GENERATIONS 


of friendly socie- 
A Nature special issue Hesseroups oflike- 
nature.com/futuregenerations 


minded people 
who — before 


welfare and insurance — would pay subscriptions while young, and 
(they hoped) receive benefits in old age, sickness and death. (In real- 
ity, and ina stark example of the pressures that still squeeze pension 
provision, many of these societies paid out more to older members 
than they could take from healthy young workers, and so went bust.) 
The concept of intergenerational equity in popular debate has 
since focused on finance, with environmental stability and sustain- 
ability tacked onto discussions only in the past 


“How do we few decades. The younger generations might 
best serve feel, quite legitimately, that they are getting a 
tomorrow’s raw deal. Just as many of the people who paid 
people?” into friendly societies never got a penny back, 


so the generation born around the turn of the 
millennium must look at the home-owning and financially secure 
baby-boomers and curse the timing of their births. Yet these are the 
young people who will, as they mature, be asked to make monu- 
mental decisions that affect not just one or two generations to come, 
but hundreds. 

As tools emerge that could eradicate the genetic basis for ill health, 
should they be used? When do nations abandon the (already shaky) 
attempts at collective action on climate change and make explicit 
their pursuit of pure self-interest? Just how do we dispose of drums 
of toxic waste that could remain hazardous for a million years? If the 
future starts tomorrow, then how do we best serve tomorrow's people? 

Perhaps there is a lesson in science fiction? Taking what we have 
and spinning it forward raises questions about the direction we head 
in — some of which are addressed in this special issue. And the best 
way to answer those questions is to work out, as best we can, where 
we, they — or if you are reading this in the future, you — want to end 
up. We start from here. m 


Climate changes 


The loss of three key facilitators must not 
impede progress on emissions mitigation. 


of the United Nations Framework Convention on Climate 

Change, she helped to lead a remarkable transition from nearly 
collapsed climate negotiations in Copenhagen in 2009, to an agreement 
between the world’s governments in Paris last year. She transcended her 
once-thankless — and largely powerless — post as facilitator-in-chief to 
become a popular and influential advocate for action on global warm- 
ing. Figueres has now announced that she will be stepping down in July. 
She will leave on a high note, but whoever fills her shoes will have to deal 
with significant head winds. 

Figueres’s departure, which became public knowledge on 19 Febru- 
ary, is part of a larger shake-up in the UN climate shop. On the same 
day, Héla Cheikhrouhou, executive director of the Green Climate Fund, 
which was created to help developing countries to reduce emissions 
and adapt to climate change, announced that she will leave her post at 
the end of her term in September. And on 15 February, former French 
foreign minister Laurent Fabius, who skilfully guided the negotiations to 
asmooth conclusion in December, announced that he is stepping down 
as president of the climate talks. French environment minister Ségoléne 
Royal will take his place until November, when the leadership transitions 
to Morocco at the next major meeting, in Marrakesh. 

In her letter to governments, Figueres lauded the Paris agreement as a 
historic achievement and said that the world is now transitioning into a 
phase of “urgent implementation” From a political perspective, it is cer- 
tainly true that the Paris agreement was historic. After all, there was no 
guarantee going into the meeting that anything at all would come out of 


C hristiana Figueres has charmed the world. As executive secretary 
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it, let alone the formal agreement that will be opened up for ratification 
on 22 April, Earth Day, this year. 

Both Figueres and Fabius deserve credit for making that happen, 
but their successors have plenty of work ahead. It is no secret that the 
actions that governments have committed to thus far fall well short of 
those needed to limit warming to 2°C, let alone to 1.5°C, which is the 
stated goal of the agreement. Nor is it clear that the world is urgently 
moving forward. 

The Green Climate Fund, which was created more than five years 
ago and has approved just eight projects, is still trying to collect the 
money promised by nations. The US Supreme Court has put US Presi- 
dent Barack Obama’ regulations for power-plant emissions on ice, 
pending a legal challenge. Policymakers in the United Kingdom are 
still debating how to proceed in the wake of a government decision last 
November — just before the climate talks got under way — to pull the 
plug on a programme supporting the development of carbon capture 
and sequestration technologies. And in another branch of the UN, the 
International Civil Aviation Organization has proposed a rule on aircraft 
emissions that is so weak as to be irrelevant. 

Nor is the Paris agreement a done deal: crucial details about the 
framework for monitoring commitments must still be negotiated. For 
instance, countries have yet to agree on precisely what kind of informa- 
tion they should submit to the UN. To track progress, build confidence 
and hopefully pave the way for more ambitious policies, scientists, 
environmentalists and governments need these data to be solid. Given 
that this objective is the only thing resembling accountability in an 
otherwise voluntary agreement, negotiations on this point could deter- 
mine whether the Paris agreement is indeed a success. 

Figueres was optimistic about the work to come. “The journey that 
lies ahead will require continued determination, 
ingenuity and, above all, our collective sense of 
humanity and purpose,’ she wrote to government 
officials. “I know that together you will again rise 
to the task.” Let us hope that she is right. = 
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policymaking in fields that take a long view, such as energy and 
the environment. It is typically used to invoke an ethical dimen- 
sion. But to whom do we refer when we speak about future genera- 
tions? This question is not often asked, and even less often answered. 

We need to define future generations carefully. Failure to do so leads 
to severe and potentially damaging shortcuts in deciding the legacy 
we choose to leave and debating its potential effects. 

A good example is the discussion of radioactive-waste management. 
Here, future generations are considered as a unified group of people: 
every generation that follows ours, with no attempt to constrain them 
by time. So, in talking about nuclear waste, the consideration of future 
generations mandates us to plan for the entire period that the material 
will be harmful — up toa million years, according to some estimates. 

That timeframe includes our grandchildren 
and those people — or other forms of life — who 
will share our planet hundreds of thousands of 
years from now. From an ethical point of view, 
it is meaningless to pretend that this is a homo- 
geneous group for which nuclear-waste policies 
will have the same impact. And trying to treat it 
as such distorts policy. 

One of the most important decisions to make 
when we plan what to do with our nuclear waste 
is whether we give people in the future the ability 
to overrule our decision and choose their own 
strategy. In a few thousand years, for example, 
a perfectly safe solution could be available. That 
argument demands that we put the waste somewhere from whence 
it can be retrieved. So, if we choose — as most nations have — to 
dispose of it in underground facilities, the ethical attitude towards 
future generations implies that we should not permanently seal off 
these disposal sites. 

At first glance, this strategy respects the autonomy of future 
generations and so is ethical. Take a closer look, however, and there is 
a point in the future when this ability of our successors to choose their 
own option becomes irrelevant. That autonomy benefits only those 
who know and remember that the waste is down there. Given that 
we struggle to answer questions about lost human civilizations from 
just a few thousand years ago, humility demands that we assume that 
knowledge of the location of disposed-of radioactive waste, and how 
to handle it, could be lost some time in the next million years. 

The French nuclear-safety authority assumes that this memory loss 
will not happen for at least 500 years. But clearly, much could happen 


between the years 
FUTURE GENERATIONS 


2500 and 1002000. 
A Nature special issue 


Some advocates of 
nature.com/futuregenerations 


Te concept of ‘future generations’ is common in research and 


retrievable waste 
options claim that 
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IT MAKES NO 


SENSE 


TO TALK OF 
‘FUTURE 
GENERATIONS’. 


Can today’s decisions 
really be future-proofed? 


The impact of long-term policies on future generations will differ over time, 
and policymaking should take this into account, argues Céline Kermisch. 


future generations will simply seal off the disposal before the memory 
is lost. Even if this is true — and we have no way of knowing — it does 
not solve the ethical problem of assuming that all future generations 
can be treated in the same way. In this case, the advantages of retriev- 
ability are relevant only to those generations alive before the disposal 
site is sealed. 

This example shows that it makes no sense to talk of ‘future 
generations. We should instead refer to two groups: ‘close future gen- 
erations’ and ‘remote future generations. How we define these groups 
depends on the problem being assessed. For radioactive-waste man- 
agement, remote generations are defined as those that have forgotten 
that the waste is there. The exact time in the future at which the mem- 
ory is lost is not a relevant question here. We assume it will happen, 
and plan accordingly. 

Addressing impacts on future generations 
in this way questions another claimed benefit 
of keeping nuclear waste retrievable — that the 
ability to monitor and maintain disposal sites is 
keeping them safe. But, given that it applies only 
to close future generations, the number of people 
protected by this policy is much smaller than is 
generally assumed. 

The same is true when talking about the 
security of the waste. Sealing off disposed radio- 
active material so that nobody — including ter- 
rorists — could access it protects fewer people 
than we might imagine. For remote future gen- 
erations, the security risk evaporates — every- 
body, including those who would use it for malevolent reasons, has 
forgotten that it is there. 

Just as there are intergenerational trade-offs between us and future 
generations, so there are similar conflicts between close and remote 
future generations. This applies to a range of long-term issues with 
impacts that vary with time. For instance, when analysing the fund- 
ing of nuclear-waste management from an ethical perspective, it is 
relevant to distinguish between generations that benefit from nuclear 
technologies and those that do not. 

This distinction could also be used to inform ethical debate on other 
long-term problems, including climate change. Remote future gen- 
erations could be, for example, those that live in a world in which the 
average temperature increase has surpassed 3 °C and sea levels have 
risen by a set amount. Their needs are then clearly framed as different 
from those of close future generations, which could remain fixated on 
trying to prevent a 2 °C rise — or persist in arguing about whether the 
threat of global warming is real. m 


Céline Kermisch works on the philosophy and ethics of risk at the 
Université Libre de Bruxelles, Brussels. 
e-mail: Celine.Kermisch@ulb.ac.be 
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Selections from the 
scientific literature 


RESEARCH HIGHLIGHTS 


CANCER BIOLOGY 


How exercise helps 
to combat cancer 


Mice that take exercise have 
fewer tumours than those 
that do not — thanks to more 
cancer-fighting immune cells 
finding their way into tumours. 
Studies in humans have 
linked the effects of taking 
regular exercise to a reduced 
risk of developing cancer. To 
investigate the underlying 
molecular mechanisms, 
Pernille Hojman of the 
University of Copenhagen in 
Denmark and her colleagues 
compared tumour growth in 
sedentary mice and in those 
that had access to an exercise 
wheel over four weeks. 
Animals that ran had about 
60% fewer tumours, which 
were also smaller in size. 
Exercise was associated with 
an increase in the number of 
a particular type of immune 
cell — the natural killer cell 
— found in the tumours. An 
exercise-induced surge in the 
hormone adrenaline mobilized 
these cells. 
Cell Metab. http://doi.org/bcenk 
(2016) 


MATERIALS 


Shells spark strong 
graphene fibre 


A composite thread with a 
structure inspired by nacre — 
the iridescent material found 
inside many seashells — is the 
strongest graphene-based fibre 
ever made. 


REGENERATIVE BIOLOGY 


Insulin from mini stomach 


Stomach tissue can be reprogrammed to mimic insulin- 
producing pancreatic cells and control diabetes when 


implanted into mice. 


A certain type of cell in one part of the stomach has similar 
gene-expression patterns to B-cells, which make insulin in the 
pancreas. Qiao Zhou at Harvard University in Cambridge, 
Massachusetts, and his colleagues reprogrammed these cells 
using a mix of key DNA-binding proteins, and found that more 
than 40% formed insulin-producing cells. The reprogrammed 
cells developed into stomach mini-organs (pictured) that, 
when transplanted into mice lacking B-cells, prevented spikes 
in blood sugar levels and kept animals alive for 6 months 
compared to untreated mice, which died within 8 weeks. 

The results suggest that the stomach could serve as a 
renewable source of B-cells to treat diabetes, the authors say. 
Cell Stem Cell http://doi.org/bcqc (2016) 


Such fibres are typically 
produced by spinning 
together nanometre-thick 
sheets of graphene oxide. 
But they often have poor 
tensile strength, probably 
owing to weak interactions 
between the nanosheets. 

To strengthen the threads, 
Qunfeng Cheng of Beihang 
University in Beijing and his 
colleagues added two more 
ingredients: calcium ions and 
a flexible carbon compound 
called PCDO. These help 

to bind the nanosheets 
together, mimicking the 
strong interactions in nacre. 
The team produced a fibre 
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roughly 20 micrometres wide 
(pictured) that could be tied in 
a knot without fracturing and 
could support a 2-gram weight. 

The fibres also conduct 
electricity, making them a 
promising material for flexible 
electrodes or artificial muscles. 
Adv. Mater. http://doi.org/f3k23w 
(2016) 


Habitats of ancient 
humans revealed 


Some hominins may have 
preferred to live in shady 
glades near fresh water nearly 


2 million years ago. 

Clayton Magill at the Swiss 
Federal Institute of Technology 
in Zurich and his colleagues 
took samples from a layer of soil 
in Olduvai Gorge, Tanzania, 
where fossil remains for early 
Homo and Paranthropus boisei 
hominins have been found. 
This 1.8-million-year-old 
soil was covered bya layer of 
volcanic ash, which preserved 
the distinctive chemical 
signatures left behind by 
ancient plants. By analysing 
these ‘biomarkers; the team was 
able to distinguish aquatic from 
terrestrial plants, grasses from 
non-grasses, and woody plants 
from herbaceous ones. 

The biomarker analysis and 
fossil remains suggest that the 
hominins favoured a small 
wooded area near a freshwater 
wetland, presumably because it 
offered drinking water, edible 
plants and shade. The presence 
of fossilized butchered animal 
remains also implies that the 
hominins brought food back to 
their woodland home from the 
larger grassland area. 

Proc. Natl Acad. Sci. USA 
http://dx.doi.org/10.1073/ 
pnas.1507055113 (2016) 


Gut microbes help 
malnutrition 


Manipulating the gut microbes 
of undernourished children 
could help them to gain weight, 
three laboratory studies in 
mice and pigs suggest. 
Malnourished children can 
struggle to gain weight even 
on high-nutrient diets, and 
studies have suggested that 
under-nutrition stops their gut 
microbiomes from maturing. 
A team led by Jeffrey Gordon 
at Washington University 
in St. Louis, Missouri, 
transplanted gut microbes 
from undernourished and 
healthy children from Malawi 
into germ-free mice, and found 
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that mice given microbes from 
healthy children gained more 
weight and muscle than did 
mice with the malnourished 
microbiomes. The team 
identified microbial species 
associated with these gains, 
and delivering two species 
to the guts of mice that had 
malnourished microbiomes 
boosted the animals’ growth. 
Martin Schwarzer and 
Francois Leulier at the 
University of Lyons, France, 
and their team found that 
giving malnourished mice 
a strain of Lactobacillus 
plantarum bacteria helped 
the mice to gain weight 
by restoring their growth 
hormone production. Ina 
third study, Gordon and his 
colleagues identified a sugar 
in breast milk that promoted 
growth in mice and piglets 
harbouring microbes froma 
malnourished child. 
Science http://doi.org/benr 
(2016); Science 351, 854-857 
(2016); Cell http://doi.org/ 
bcqd (2016) 


ANIMAL COGNITION 


Horses read 
human emotions 


Horses can differentiate 
between happy and angry 
human faces. 

Researchers previously 
showed that dogs can identify 
emotions from human faces. 
To find out whether horses 
share this ability, Amy Smith, 
Karen McComb and their 
colleagues at the University of 
Sussex in Brighton, UK, tested 
the response of domestic horses 


(Equus caballus; pictured) 

to photographs of human 
faces with happy or angry 
expressions. Horses tended to 
view angry faces with their left 
eyes — a sign that they were 
processing the image using 
the brain’s right hemisphere, 
which is thought to handle 
negative stimuli. The animals’ 
heart rates also increased more 
rapidly in response to angry 
faces than to happy ones. 

The ability of horses to 
recognize human expressions 
could have evolved during 
domestication, the authors 
say. They add that the animals 
probably also refine this skill 
during their lifetimes. 

Biol. Lett. 12, 20150907 (2016) 


First super-Earth 
atmosphere 


The first gases to be identified 
around an exoplanet that is 
slightly larger than Earth show 
that its atmosphere is probably 
rich in hydrogen and carbon. 

Angelos Tsiaras of 
University College London 
and his colleagues used a 
camera on the Hubble Space 
Telescope to probe the planet 
55 Cancri e, which has a 
radius twice that of Earth, 
lies 12 parsecs (about 40 light 
years) away and orbits close to 
its host star. 

The team found that the 
planet's atmosphere contains 
no water vapour but does have 
hydrogen, probably in the form 
of hydrogen cyanide, which 
is an indicator of carbon-rich 
atmospheres. The high level 
of carbon suggests exotic 
chemistry around the planet. 
Astrophys. J. in the press; 
preprint at http://arxiv.org/ 
abs/1511.08901v2 (2016) 


Climate risks of 
low-carbon power 


A transition to power plants 
that capture and store carbon 
could increase water use, 
probably leading to shortages 
in a major UK river basin as 
early as the 2030s. 


RESEARCH HIGHLIGHTS MiiiSaiaa¢ 


Carbon capture and storage 
(CCS) technologies that 
minimize emissions require 
a lot of water for cooling, but 
climate change is predicted 
to lead to drier summers 
in the United Kingdom. 
Edward Byers of Newcastle 
University, UK, and his 
colleagues used climate and 
hydrology models to analyse 
scenarios in which electricity 
production rises by 55% by 
2040 as CCS technologies 
are fully adopted in the River 
Trent basin — the largest 
UK inland source of cooling 
water for power generation. 
The team projected declining 
river flows alongside rising 
demand for water from the 
power sector. 

The researchers found 
that CCS power plants might 
become less reliable in the 
future when the river runs 
low, unless the most water- 
efficient technologies are 
used. 
Environ. Res. Lett. 11,024011 
(2016) 


| NANOSCIENCE 
Nanorotor made 
of DNA 


Researchers have made a 
nanometre-scale rotor out of 
3D fragments of DNA. 

Hendrik Dietz and his 
colleagues at the Technical 
University of Munich, 
Germany, designed the 
fragments to self-assemble 
into a rotor that looks like a 
helicopter blade (pictured). 
It spins around on an axle and 
is clamped in place by two 
other DNA units. To stop the 
blade from rotating freely, 
the team used docking sites 
on the inside of the clamp 
units so that the rotor was 
held in place. When the ion 
concentration of the solution 
containing the rotor changed, 
the rotor was released and 
could spin owing to Brownian 
motion. 

The researchers say this 


action mimics that of an 
enzyme in the body that acts 
like a rotary machine, and 
claim that their nanomachine 
is more structurally complex 
than previous ones. 

Sci. Adv. 2,e1501209 (2016) 


ROBOTICS 


Microbots dance 
in the light 


Tiny soft robots can swim, 
spin and make other complex 
motions in response to light. 

Microbots — micrometre- 
scale robots — could one day 
be used to deliver medicines or 
to collect data inside the body. 
But most microbots are made 
of rigid materials, which limit 
them to simple back-and-forth 
motions, usually in response to 
chemical or magnetic forces. 
To develop microbots that 
are capable of more-versatile 
movement, Peer Fischer at 
the Max Planck Institute 
for Intelligent Systems in 
Stuttgart, Germany, and his 
colleagues designed soft, 
flexible ones using a liquid- 
crystal elastomer that was 
both rubbery and responsive 
to light. 

By sweeping bands of 
light across disc- and rod- 
shaped microbots, the team 
induced wave-like swimming 
motions similar to those used 
by single-celled organisms. 
Different patterns of light 
also made the disc-shaped 
microbots spin, reverse their 
spinning direction or move 
around a path in the shape of 
a square. 
Nature Mater. http://doi.org/bcjk 
(2016) 
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EVEN DAYS esesns 


EVENTS 


Earthquake action 
Citing a sharp spike in 
earthquake activity, a US 
environmental group has 
filed a federal lawsuit seeking 
to curb the injection of 
wastewater from hydraulic 
fracturing (fracking) and 

oil activities in Oklahoma. 
The California-based Sierra 
Club, working with law 

firm Public Justice, filed 

the lawsuit against 3 energy 
companies on 16 February, 

3 days after a magnitude-5.1 
earthquake shook the state. 
There were 907 earthquakes 
of magnitude 3 or larger in 
2015, compared with 109 in 
2013. Scientists have linked the 
earthquakes to the injection 
of wastewater, a by-product of 
fracking. 


ExxonMobil anger 


More than 100 scientists have 
signed a letter urging the 
American Geophysical Union 
(AGU) to reject sponsorship 
from US oil and gas giant 
ExxonMobil. Following 

news reports in late 2015 that 
indicated that ExxonMobil 
scientists knew about the 
threat of global warming 
decades ago, the letter says that 
ExxonMobil’ complicity in 
spreading “climate denial and 
misinformation” has been well 
documented. “By allowing 
Exxon to appropriate AGU’s 
institutional social license to 
help legitimize the company’s 
climate misinformation, AGU 
is undermining its stated 
values as well as the work of 
many of its own members,’ the 
22 February letter states. 


January heats up 


Last month was the world’s 
hottest January since records 
began in 1880, and the ninth 
month in a row to breaka 
global monthly temperature 
record, the US National 
Oceanic and Atmospheric 


Sentinel-3A launches to monitor Earth 


The European Space Agency successfully 
launched the Earth-observing satellite 
Sentinel-3A on 16 February. The launch, 
from Plesetsk, Russia, carried a suite of four 
Earth-observing instruments into orbit as 
part of Europe’s Copernicus programme. 
Sentinel-3A is one of two identical satellites, 


Administration reported on 
17 February. The average 
global temperature was 1.04°C 
above the twentieth-century 
average for January, beating 
the previous record, from 
2007, by 0.16 °C. In the Arctic, 
which was remarkably warm 
for the time of year, sea ice was 
at its lowest January extent 
since records began in 1979, 
according to the US National 
Snow and Ice Data Center. 


Pp PEQPLE 
New NAS president 


Geophysicist Marcia McNutt 
has been elected president of 
the US National Academy of 
Sciences (NAS) by its members, 
the society announced on 

16 February. McNutt is 
currently editor-in-chief 

of Science and its family of 
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and 2015. 


journals. She begins her 
six-year term at the NAS on 

1 July, replacing atmospheric 
scientist Ralph Cicerone, who 
has served the maximum two 
terms. McNutt was director of 
the US Geological Survey from 
2009 to 2013. 


FDA vote due 


As Nature went to press, the 
US Senate was poised to vote 
on the nominee to head the US 
Food and Drug Administration 
(FDA), clinical-trials expert 
Robert Califf. On 22 February, 
lawmakers voted to limit 
debate on Califf’s nomination, 
setting the stage for a final 

vote that will determine 
whether he will become FDA 
commissioner. President 
Barack Obama nominated 
Califf last September, but 
some lawmakers delayed 


the second of which will launch in 2017. Over 
the oceans, they will measure sea surface 
temperatures and sea-ice thickness, and over 
land they will map how land is used, the extent 
of vegetation and the height of rivers and lakes. 
Two other Sentinel satellites launched in 2014 


consideration of the 
nomination over concerns 
about the FDAs recent 
approval of genetically 
engineered salmon for use as 
food and the agency's handling 
of an ongoing opioid-approval 
crisis. See go.nature.com/ 
mfdn9u for more. 


Italy research chief 
Physicist Massimo Inguscio 
took office on 22 February as 
president of Italy’s National 
Research Council (CNR). 
Inguscio, who comes to the 
post from the University of 
Florence, will need to ensure 
that research in the 103 CNR 
institutes can stay on track 
without significant institutional 
funds — salaries and general 
expenses absorb most of the 
CNRss budget. He will also need 
to maintain the organization’s 
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commitment to research 
integrity: the CNR is currently 
working its way through several 
cases of potential misconduct 
in papers co-published by its 
scientists. 


More KI fallout 


Heads continue to roll in 

the scandal rocking the 
Karolinska Institute (KI) in 
Stockholm: Hans-Gustaf 
Ljunggren resigned as dean 

of research on 22 February. 

He had participated in the 
decision to clear surgeon Paolo 
Macchiarini of allegations 

of misconduct in carrying 

out experimental transplants 
of artificial windpipes into 
patients. Earlier this month, 
two other KI professors 
resigned as vice-chancellor 

of the university and as 
secretary-general of the Nobel 
Assembly that chooses the 
Nobel physiology or medicine 
prize. The KI and its hospital 
have opened new investigations 
into Macchiarini, and Swedish 
health authorities have referred 
the matter to the police. 


Figueres bows out 


Christiana Figueres (pictured) 
will step down in July after 

her six-year term at the 

helm of the United Nations 
Framework Convention on 
Climate Change comes to an 
end. The Costa Rican diplomat 
had a key role in international 
climate negotiations, which 
culminated in the landmark 


TREND WATCH 


Only 36% of clinical trials 
completed in 2007-10 at 

51 leading US institutions 
were published or reported 
on ClinicalTrials.gov within 
2 years of the trials finishing, 


according to an analysis in the 
BMJ (R. Chen et al. Br. Med J. 
http://doi.org/bcqf; 2016). The 
University of Minnesota, which 
ranked 15th in terms of number 
of trials, was most prompt at 
reporting results, with 55% of 
trials reported within two years. 
See go.nature.com/9cwl19a for 
more. 


—— 


emissions-reduction agreement 
reached at the Paris climate 
talks in December last year. 
Figueres’s yet-to-be-named. 
successor will be charged with 
overseeing the implementation 
of the agreement, details of 
which are to be discussed at the 
UN Conference of the Parties 
in Morocco in November. See 
page 382 for more. 


Stem-cell leader 


Stem-cell biologist Robert 
Lanza announced on 

22 February that he will be the 
head of global regenerative 
medicine for Astellas Pharma, 
the Tokyo-based firm that 
acquired Lanza’s company 
Ocata Therapeutics earlier 
this month. Lanza is the chief 
scientific officer for Ocata, 
based in Marlborough, 
Massachusetts, which has been 
a leader in developing stem- 
cell therapies, despite financial 
troubles and management 
changes over the past 15 years 
(see Nature 481, 130-133; 


2012). Ocata is running clinical 
trials to test stem-cell-derived 
retinal cells on vision-loss 
disorders. Lanza says that those 
trials and other ongoing Ocata 
research programmes will 
continue under Astellas. 


LIGO-India is go 


India’s cabinet approved plans 
to build a gravitational-wave 
detector on 17 February. The 
machine, LIGO- India, will 

be a copy of two LIGO (Laser 
Interferometer Gravitational- 
Wave Observatory) machines 
in the United States, which this 
month reported the first direct 
detection of these long-sought 
ripples in space-time. Having a 
third machine on the other side 
of Earth should help physicists 
to pinpoint the sources of 
gravitational waves. The site 
for the 12.6-billion-rupee 
(US$183-million) project has 
yet to be selected, but it could 
be operational by 2023. 


Space panorama 
NASA is officially moving 
ahead with its next flagship 
astrophysics observatory, the 
Wide Field Infrared Survey 
Telescope (WFIRST), the 
agency stated on 18 February. 
Set to launch in the mid-2020s, 
WEIRST will use a 2.4-metre 
telescope repurposed from the 
US National Reconnaissance 
Office to capture the sky in 
infrared wavelengths. With 


THE TRIAL-PUBLISHING PROBLEM 


Results from many clinical trials completed in 2007-10 at 
US institutions were unpublished or unreported two years after the 
studies had finished. (Top ten institutions listed by number of trials.) 
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SEVEN DAYS | THIS WEEK | 


25-26 FEBRUARY 
Potential marine 
World Heritage sites 
in the Arctic will be 
discussed at UNESCO 
headquarters in Paris. 
go.nature.com/crvleg 


26-28 FEBRUARY 
Mumbai in India 

hosts the American 
Association for Cancer 
Research's conference 
on new ideas in cancer. 
go.nature.com/9mf9wt 


29 FEB-2 MARCH 
NASA invites the 
astrophysics community 
to Pasadena, California, 
to discuss the science 
that can be done with 
the Wide Field Infrared 
Survey Telescope, which 
will study the nature of 
dark energy and search 
for exoplanets. 
go.nature.com/chvjtq 


its wide-range view, it will be 
able to measure light from one 
billion galaxies and illuminate 
the nature of the dark energy 
thought to be accelerating the 
expansion of the Universe. 


Poor policy stances 


Of more than 200 institutions 
that have produced public 
policy statements on animal 
research, most do not meet 

the highest standards of 
openness, according to the 
group Speaking of Research, 
which campaigns to improve 
understanding of animal work. 
The group analysed policy 
statements from institutions 

in Europe, North America and 
Australia and rated them on the 
level of detail provided, such 

as welfare information and 
images. Only ten institutions 
got full marks, the group said 
on 22 February. 
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The plant Artemisia annua (pictured being harvested in Tanzania) was the only source of artemisinin before biochemists invented a synthetic route. 


PUBLIC HEALTH 


Synthetic malaria drug 
meets market resistance 


First commercial deployment of synthetic biology for medicine has modest impact. 


BY MARK PEPLOW 


hen Paris-based pharmaceutical 
giant Sanofi started to sell malaria 
drugs made with the help of geneti- 


cally engineered yeast in 2014, the move was 
hailed as a triumph for synthetic biology. The 
yeast was fermented in a vat to produce a chemi- 
cal that Sanofi converted into artemisinin, which 
is used to make leading malaria treatments 


called artemisinin-based combination therapies 
(ACTs). Many hoped that the process would 
offer a cheap and plentiful supply of drugs to 
tackle a disease that claims almost half a million 
lives worldwide every year. 

Yet Sanofi produced no ‘semi-synthetic 
artemisinin (SSA) at all in 2015, Nature has 
learned. And the company is now selling the 
manufacturing site in Garessio, Italy, where it 
made its SSA. 


That such celebrated drugmaking 
technology — developed with the help of 
US$64 million from the Bill & Melinda Gates 
Foundation — stands idle illustrates the com- 
plicated web of economic forces that affects 
the market for malaria drugs. “This is a perfect 
example of how a new manufacturing process 
becomes extremely hard to scale up when 
there is a complex ecosystem of players,’ says 
Prashant Yadav, a health-policy researcher 
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| NEWS IN FOCUS 


Artemisinin-based combination therapy pills (shown here in China) are leading treatments for malaria. 


> at the William Davidson Institute at the 
University of Michigan, Ann Arbor, who studies 
the ACT market. 

Before the advent of SSA, the only source 
of artemisinin was the sweet wormwood plant 
(Artemisia annua), the discovery of which won 
Chinese scientist Youyou Tua share of the 2015 
Nobel Prize in Physiology or Medicine. But the 
agricultural supply has been erratic. Short- 
ages of A. annua send prices soaring, which 
attracts more farmers to plant it; their produce 
then swamps the market, depressing prices 
and triggering fresh shortages (see ‘A stable 
artemisinin market?’). 


SUPPLEMENT OR SAVIOUR? 

The synthetic-biology route promised to 
end this rollercoaster by providing a stable 
and reliable source of artemisinin. Sanofi 
developed the capacity to produce almost 
60 tonnes of the chemical per year — about 
one-third of global need — and the company 
hoped to supply other ACT manufacturers 
with the raw materials. 

“In reality, that has not happened,” says 
Yadav. Sanofi has so far used its SSA to make 
more than 39 million treatments of its own 
version of ACT — representing about 10% of 
global ACT demand — but has not sold the 
chemical to other drugmakers. 

That is partly because of a glut in 
agricultural artemisinin. For the past two 
years, the naturally derived chemical has 
sold for less than $250 per kilogram — below 
Sanofi’s ‘no profit-no loss’ margin of around 
$350-400 per kilogram. “Tf that price is already 
very low and there's a bumper crop, there's no 
reason to fire up a fermenter,’ says Jay Keasling 
of the University of California, Berkeley, who 
led the team that first developed the yeast strain. 

But ACT manufacturers such as 


China's Guilin Pharma and India’s Cipla are also 
reluctant to buy their drug ingredients from 
Sanofi, says Yadav, because the company is a 
direct competitor in the ACT market. 

And Sanofi has not found it worthwhile to 
increase production of its own ACT because 
demand has plateaued. This is in part the result 
of growing efforts to diagnose malaria before 
doling out medicine: malaria treatments are 
often taken by people with fevers who do not 
actually have malaria, so more-accurate diag- 
noses help to reduce the number of treatments 
needed. Whether demand will rise again will 
depend on how international efforts to tackle 
malaria develop in the future, and how much 
funding will be available to purchase ACTs. 

By July, Sanofi will complete the sale of its 
Garessio manufacturing plant to Bulgarian 
company Huvepharma, a contract manufacturer 


A STABLE ARTEMISININ MARKET? 


After a decade of instability, prices of the 
malaria drug artemisinin have dropped 
and demand has stopped rising. 
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responsible for fermenting the engineered 
yeast in vats to make artemisinic acid — the 
precursor to artemisinin — for Sanofi. 

Nicola de Risi, a manager for Huvepharma in 
Rome who will head the firm’ Italian division, 
hopes that by gaining control of the entire SSA 
production process (from yeast to final prod- 
uct), the company will be able to lower costs 
and make sales to other ACT manufacturers. 
But Huvepharma will switch to using plant- 
derived artemisinin if it cannot make SSA 
cost-competitive, de Risi says. 

PATH, a global-health organization based 
in Seattle, Washington, which coordinated the 
development of SSA, says that it still consid- 
ers the project a success. “Since SSA entered 
the market, we have observed better price sta- 
bility, and there has been adequate supply of 
artemisinin, it said in a statement. 

“There is merit to the argument that SSA 
has contributed somewhat to stabilizing 
prices,’ says Yadav. But the main causes of price 
stability, he adds, are the recent steady demand 
for ACTs and long-term purchasing con- 
tracts with ACT manufacturers, set up by 
the Global Fund to Fight AIDS, Tuberculosis 
and Malaria. 

PATH and Keasling say that SSA was always 
intended to be a supplemental source to fill 
gaps in agricultural production, or to cope 
with spikes in demand. But Claire Marris, a 
sociologist of science at City University Lon- 
don who previously worked at the Centre for 
Synthetic Biology and Innovation at Imperial 
College London, says that in her experience, 
SSA is often portrayed by those working in 
the field as simply a low-cost, high-volume 
substitute for agricultural artemisinin. “It was 
constantly talked about,’ she says. Now, Marris 
worries that unrealistic expectations for SSA’s 
achievements could damage public trust in 
synthetic biology. 

When the Gates Foundation awarded the 
first ofits grants for the SSA project in 2004, it 
explicitly aimed to lower the cost of each ACT 
treatment from $2.40 to “well under a dol- 
lar”. But the median price of Sanofi’s ACT had 
already dipped to $0.92 per adult treatment by 
2012, well before the introduction of SSA, and 
it has changed little since then. 

De Risi says that SSA production will restart 
later this year so that Sanofi can produce its 
own ACT treatment. “I think it’s good for syn- 
thetic artemisinin,’ says Yadav, who points out 
that other ACT producers may be more willing 
to buy artemisinin from Huvepharma because 
it is not an ACT producer itself — and there- 
fore not a direct competitor. 

Meanwhile, Guilin Pharma and Cipla 
are making plans to develop their own SSA, 
and Keasling hopes that more research and 
development work could make the synthetic 
process cheaper in the long term. “Td like to 
see SSA take over as the dominant form, and 
some day I think it will,” says Keasling. “But we 
have to be patient” = 
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New cancer models could help scientists to devise better treatments. 


BIOMEDICAL SCIENCE 


US cancer institute 
overhauls cell lines 


Veteran cells to be replaced by human tumours grown in mice. 


BY HEIDI LEDFORD 


fter more than 25 years of heavy use by 
Ase: around the world, the US 

National Cancer Institute (NCI) has 
decided to stop screening most drugs using the 
NCI-60, its panel of 60 human cancer cell lines 
grown in culture. In late spring of this year, the 
institute will launch a rejuvenated repository 
of cancer models that are derived from fresh 
patient samples and tagged with details about 
their clinical past. 

The NCI action responds to a widespread 
push for cancer models with a closer link to the 
patients they are intended to help. On 11 Feb- 
ruary, cancer researchers gathered in New 
Orleans, Louisiana, for a meeting hosted by 
the American Association for Cancer Research 
that focused on the creation of new models 
from clinical samples. 

Since 1990, industry and academia have 
screened more than 100,000 compounds using 
the NCI-60, in order to study the molecular 
details of cancers. 

When the NCI-60 was established, research- 
ers had a very different conception of cancer, 
says James Doroshow, director of the Division 
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of Cancer Treatment and Diagnosis at the NCI 
in Bethesda, Maryland. “Thirty years ago, the 
idea was that if you found a drug that worked 
on six breast cancer cell lines, then you could 
use it to treat breast cancer,’ he says. “Well, it 
doesn’t work that way.” 

Since then, breast cancer has been broken 
down into subcategories that are based on 
genetic mutations — and each category may 
respond differently to treatment. 

The NCI-60 cell lines have also lived for 
thousands of generations in culture. Over 
time, this has altered their genetic make-up 
and behaviour. 

The NCI will continue to supply the NCI-60 
lines to researchers, but will eventually refocus 
its drug screening on newer models. It is devel- 
oping hundreds of ‘patient-derived xenografts’ 
(PDXs) by implanting small chunks of human 
tumours in mice — an environment that better 
mimics the human body. The tumours can then 
be harvested and reimplanted in other mice, 
allowing researchers to study a given tumour 
in multiple animals. The NCI will distrib- 
ute cells from those PDXs, plus data on each 
tumour’s genome sequence and gene-expres- 
sion patterns, and the donor's treatment history. 


IN FOCUS | NEWS 


The institute will also make cell lines from 
the samples for use in more-detailed biochem- 
ical studies and in drug screening. And it is 
developing cell cultures and xenografts from 
tumour cells that are circulating in the blood to 
model tumours that are difficult for surgeons 
to biopsy. Doroshow estimates that his team 
will have 75 models ready for public distribu- 
tion when the repository opens; the group aims 
to produce 1,000 in the first phase. 

The NCI effort reflects a wider trend. Sixteen 
European institutions have formed EurOPDX, 
a consortium that boasts 1,500 PDXs. The 
Jackson Laboratory, a non-profit company in 
Bar Harbor, Maine, has 450 PDXs, and another 
100 in development. Many more reside in 
pharmaceutical companies: last year, the Swiss 
pharma giant Novartis published a drug screen 
using 1,000 PDXs (H. Gao et al. Nature Med. 
21, 1318-1325; 2015). 

PDXs have also garnered attention as models 
to guide treatment of individual patients: mice 
bearing PDXs could serve as ‘avatars’ to allow 
physicians to screen for the most-effective 
treatment regimen. But the process of gen- 
erating a PDX is often too slow to benefit the 
donor, says Edison Liu, chief executive of the 
Jackson Laboratory. Instead, Liu sees Novartis’s 
approach — studying large collections of PDXs 
to help future patients — as more promising. 

Such models can capture the genetic 
complexity of human cancers better than old 
cell cultures or genetically engineered mice, 
but PDXs also have shortcomings. Most are 
generated in mice that lack normal immune 
responses, to prevent rejection of the human 
cells. Efforts are under way to engineer mice 
with aspects of the human immune system, 
but no mouse fully captures the complexity of 
the system. 

Despite the limitations, some researchers 
have translated PDX results into clinical 
gains. Livio Trusolino, a cancer researcher at 
the University of Turin in Italy, and his col- 
leagues mined their collection of 600 colorec- 
tal cancer PDXs. They found that PDXs from 
some drug-resistant tumours responded better 
to a combination of treatments normally used 
against breast cancer — a result that was then 
borne out in a small clinical trial, Trusolino 
announced at the meeting in New Orleans. 

”For the first time in my life, my results have 
been translated into a benefit for patients,” 
Trusolino says. “It is very rewarding” = 
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Future-proofing 
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the genomes of 
future generations? 
nature.com/nature/ 
podcast 


25 FEBRUARY 2016 | VOL 530 | NATURE | 391 


© 2016 Macmillan Publishers Limited. All rights reserved 


CONSERVATION 
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Logging in the Brazilian Amazon has decreased in the past decade, but is still a problem. 
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Satellite alerts track 
deforestation in real time 


System uses Landsat data to issue warnings just hours after tree loss is detected. 


BY GABRIEL POPKIN 


satellite-based alert system could prove 
A: potent weapon in the fight against 

deforestation. As few as eight hours 
after it detects that trees are being cut down, 
the system will send out e-mails warning that 
an area is endangered. That rapid response 
could enable environmental managers to 
catch illegal loggers before they damage large 
swathes of forest. 

“Tt’s going to be very, very helpful, says Brian 
Zutta Salazar, a remote-sensing scientist at 
the Peruvian Ministry of the Environment in 
Lima. 

Satellites are already valuable tools for 
monitoring deforestation; in recent decades, 
they have delivered consistent data on forest 
change over large and often remote areas. One 
such effort, the Real Time System for Detec- 
tion of Deforestation, or DETER, has helped 
Brazil's government to reduce its deforestation 
rate by almost 80% since 2004, by alerting the 
country’s environmental police to large-scale 
forest clearing. 

But DETER and other existing alert systems 
can be relatively slow to yield useful informa- 
tion. They use data from the Moderate Resolu- 
tion Imaging Spectroradiometer (MODIS) on 


Site of deforestation in Peru as detected using 
MODIS data (top) and Landsat data (bottom). 


NASA’ Terra satellite, which at its top resolu- 
tion produces images with pixels covering an 
area 250 metres on each side, roughly equiva- 
lent to 10 football pitches. This is too big to 
spot small changes in land cover, so it can take 
computer programs that process MODIS data 
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weeks or even months to detect that a forest is 
being cleared. “By the time MODIS picks up on 
it, it's almost too late,” says Peter Ellis, a forest- 
carbon scientist at the Nature Conservancy, a 
conservation group in Arlington, Virginia. 

Seeking to provide a sharper view, geog- 
rapher Matthew Hansen of the University of 
Maryland in College Park and his colleagues 
published maps showing year-to-year changes 
in global forest cover from 2000 to 2012 
(M. C. Hansen et al. Science 342, 850-853; 
2013). The researchers relied on data from 
NASA’s two active Landsat satellites, which — 
together photograph every spot on Earth 
every eight days. Each pixel in a Landsat image 
is 30 metres on each side — roughly the size of 
a baseball diamond. This means that an area 
covered by just one MODIS pixel is captured 
in roughly 70 smaller Landsat pixels. 

Hansen and his team wrote data-processing 
software that can use these higher-resolution 
images to recognize a disturbance as small 
as a road snaking its way through a previ- 
ously untouched forest, something that often 
appears before clear-cutting begins. “It's much 
more advantageous to detect a smaller clearing 
first, and to figure out what that is, compared 
to finding something large,” says Salazar. 

Using this system, Hansen and his 
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GOOGLE/NASA/USGS VIA GLOBAL FOREST WATCH 


colleagues plan to start updating the 
forest-change maps on their website more 
frequently — just hours after new defor- 
estation is detected. A paper detailing the 
team’s methodology has been accepted for 
publication in Environmental Research 
Letters. 

The World Resources Institute, an 
environmental group in Washington DC, 
will provide access to the deforestation 
alerts on its Global Forest Watch website, 
beginning in early March. When trees 
disappear between successive Landsat 
passes, a pixel representing that location 
on an online map will turn red. Users will 
also be able to sign up to receive e-mails 
when land cover changes in a specific 
area, such as a park, an indigenous ter- 
ritory or a privately owned forest, which 
could facilitate rapid responses to small- 
scale deforestation. 

But the alert system is limited in some 
important ways. Landsat detects only vis- 
ible light and short-wave infrared, so it 
cannot see through clouds. This means 
that in the planet’s cloudiest areas — such 
as many tropical rainforests — the satel- 
lites could go months without collecting 
images of the underlying land. Nor can the 
probes reliably detect damage from activi- 
ties that leave the forest canopy intact, 
such as selective logging and the gather- 
ing of wood for fuel. 

Because such forest degradation can 
yield carbon emissions that approach half 
those from all-out deforestation, tracking 
this damage is important in understand- 
ing the role of forests in climate change, 
says Alessandro Baccini, a remote-sensing 
expert at the Woods Hole Research Center 
in Falmouth, Massachusetts. 

And then there is the difficulty of ensur- 
ing that deforestation alerts reach not only 
government officials, but also people who 
live in or near affected forests. “We will 
have faster detection, but OK, so what, 
what do you do next?” says Carlos Souza, 
aresearch scientist at Imazon, an environ- 
mental research institute in Belém, Brazil. 
He would like to see residents of remote 
forests trained to check the veracity of 
satellite-driven alerts on the ground. That 
could also help countries to improve their 
estimates of greenhouse-gas emissions 
from tree loss. 

In the meantime, the Landsat-based 
alerts represent a significant step forward 
for the fight against deforestation, says 
Frances Seymour, a senior fellow at the 
Center for Global Development, a think 
tank in Washington DC. “In the context of 
law enforcement, timeliness is everything,” 
she says. “A couple weeks later, not only is 
the forest gone, but so is the equipment and 
all the evidence you might be able to use for 
asuccessful prosecution.” m 
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European bison in Biatowieza Forest, where Poland’s government is considering increased logging. 


Pristine forest at risk 


Researchers suspect motives for a logging proposal are 
commercial, but forest administration cites pest control. 


BY QUIRIN SCHIERMEIER 


Polish proposal to increase logging in 
A« ancient Bialowieza Forest is drawing 

fresh criticism from scientists. They sus- 
pect that the motives are partly commercial, and 
dispute claims that an outbreak of bark beetle 
threatens the forest. The Polish Ministry of the 
Environment says that there is no commercial 
benefit to the proposed logging and insists that 
it is needed for pest control. 

The 1,500-square-kilometre forest, which 
straddles the Poland-—Belarus border, has 
remained largely unchanged for centuries, 
making it a matchless stomping ground for 
researchers tracing the behaviour and ecol- 
ogy of insects, birds and mammals, including 
the largest population of European bison 
(Bison bonasus). 

It is also a source of ecological measure- 
ments, for example on regeneration after 
disturbances, that inform forest management 
elsewhere, says Rafat Kowalczyk, director of 
the Polish Academy of Sciences’ Mammal 
Research Institute in the village of Bialowieza. 

A Bialowieza management plan limits log- 
ging in the forest to 48,000 cubic metres of 
wood per year — enough to allow locals to 
gather firewood. But on 10 November, the local 
forest administration proposed an amend- 
ment that would allow large-scale logging in 
sections outside the central 17% of the forest 
that is a national park. They cited an outbreak 
of the bark beetle pest (Ips typographus) in 
Bialowieza’s Norway spruce (Picea abies). In 
one forest district where logging is currently 
limited to 6,000 m’ per year, the allowable 
yearly volume would increase to 53,000 m’. 

On 18 November, scientists with Poland’s 
State Council for Nature Conservation 


condemned the proposal; public protests have 
followed. This week in Nature, Polish biologists 
express other concerns in two Correspondence 
articles (P. Chylarecki and N. Selva Nature 530, 
419; 2016; P. Michalak Nature 530, 419; 2016). 

Conservation council member and Cor- 
respondence author Przemystaw Chylarecki, 
who is an ornithologist at the Museum and 
Institute of Zoology in Warsaw, suspects that 
commercial considerations, not just pest 
control, are behind the plan. Poland’s gov- 
ernment was elected in October — and the 
environment minister referred to the wasted 
commercial potential of unlogged trees in his 
election campaign, notes Chylarecki. 

But an environment ministry spokesman, 
Jacek Krzeminski, says that there is no commer- 
cial incentive because the wood is only good for 
firewood, and the costs of logging and transport 
make it unprofitable to sell the wood on. 

Kowalczyk, who also opposes the logging 
proposal, says that the pest-control argument is 
misguided. Recurring bark-beetle outbreaks do 
not endanger the forest at large because more- 
resilient tree species spread and replace spruce, 
he says. “That’s a perfectly natural process and 
endlessly preferable to cutting down trees.” But 
Jarostaw Krawczyk, spokesman for the regional 
state forest directorate in Bialystok, says that the 
current outbreak is unprecedented in scale and 
has already begun to attack other tree species. 

A detailed assessment of forest health is under 
way, says Krzeminski. Earlier this month, a 
regional environment agency suggested that 
the amount of extra logging be reduced to half 
of the volume proposed in the new management 
plan, whereas Poland’s national forest authority 
has yet to weigh in. Depending on its opinion, 
the ministry will decide on the amendment later 
this year, Krzeminski told Nature. = 
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| NEWS IN FOCUS 


Scientists 
oppose monkey 
import ban 


Rush of protests against 
proposed Australian bill. 


BY BIANCA NOGRADY 


bill calling for a ban on the import 
A: non-human primates for medical 

research in Australia almost slipped 
under scientists’ radar — but researchers have 
now rushed to argue against the proposal. 

A Senate committee is considering the bill 
and will report in early March, before a possi- 
ble debate and vote by the full Senate. That it is 
now publicly opposed by several scientific insti- 
tutions worldwide is largely down to the efforts 
of neuroscientist Nicholas Price, who says that 
he was shocked to hear about it in late January. 

Researchers had missed the legislative 
proposal when it was introduced in September 
2015 as an amendment to Australia’s federal 
Environment Protection and Biodiversity 
Conservation Act. This was because the 
committee that deals with this legislation is 
not usually of interest to those in the medical- 
research community, says Price, who uses 
marmosets and macaques in experiments at 
Monash University’s Biomedicine Discov- 
ery Institute in Melbourne. By the time he 
heard about the proposed ban, from another 
researcher, the window for public comment was 
days away from closing (although it was later 
extended). Price and his Monash colleagues 
James Bourne and Marcello Rosa began e-mail- 
ing researchers around the world, and institu- 
tions rushed to submit statements of opposition. 

Senator Lee Rhiannon, a member of the 
Greens party who trained as a zoologist and 
introduced the proposal, told Nature that her 
party is not calling for a ban on non-human 
primates in research. The bill is a “modest” way 
to improve the welfare of research animals, 
she says. But Price and Bourne say that cut- 
ting off access to the genetic diversity required 
to maintain Australia’s three main breeding 
colonies would eventually lead to inbreeding, 
health problems and the end of the country’s 
research on non-human primates. 

Australia’s Parliament may be dissolved 
for a general election before the bill makes it 
to a vote — as happened to a similar bill that 
Rhiannon introduced in 2012. Price says that 
the events have convinced him ofa need to be 
more public about the importance of his work. 
“We feel that the majority of the public would 
be very supportive,” he says. = 
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THOROUGHLY MODERN MILGRAM 


A modern version of Stanley Milgram’s experiments on obedience to 
authority avoids the ethical pitfalls of the classic 1960s studies. 


Volunteers knowingly inflict real pain: a 
cash fine and an electric shock, or just a 
fine. They take turns to be ‘victim’. 


Feelings probed using neural 

readings or volunteers’ sense 
of time lag between key being 
pressed and a tone sounding. 


PSYCHOLOGY 


How the 
reacts to 


Experimenter tells one 
volunteer which key to press 
— or turns away and offers a 
free choice of key press. 


Keyboard: one key 
inflicts harm, one 
doesn’t. 


Shock calibrated to pain 
tolerance of victim. 


= 


brain 
orders 


Modern spin on iconic Milgram experiments suggests that 
people obeying commands feel less responsible for actions. 


BY ALISON ABBOTT 


ore than 50 years after a controver- 
M sial psychologist shocked the world 
with studies that revealed people's 
willingness to harm others on order, a team of 
cognitive scientists has carried out an updated 
version of these iconic ‘Milgram experiments. 

The findings may offer some explanation 
for Stanley Milgram’s uncomfortable revela- 
tions. When following commands, the team 
says, people genuinely feel less responsibility 
for their actions — whether they are told to 
do something evil or benign. 

“Tf others can replicate this, then it is giving 
us a big message,” says neuroethicist Walter 
Sinnot-Armstrong of Duke University in Dur- 
ham, North Carolina, who was not involved 
in the work. “It may be the beginning of an 
insight into why people can harm others if 
coerced: they don’t see it as their own action.” 

The study may feed into a long-running 
legal debate about the balance of personal 
responsibility between someone acting 
under instruction and their instructor, says 
Patrick Haggard, a cognitive neuroscientist at 


2016 
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University College London, who led the work, 
published on 18 February (E. A. Caspar et al. 
Curr. Biol. http://doi.org/bcnj; 2016). 

Milgram’s work in the 1960s was motivated 
by the trial of Adolf Eichmann, a Nazi who 
argued that he was ‘just following orders’ when 
he sent Jews to their deaths. The latest find- 
ings dont legitimize harmful actions, Haggard 
emphasizes, but do suggest that the excuse of 
‘obeying orders’ betrays a truth about howa 
person feels when acting under command. 

In his experiments, Milgram told partici- 
pants that a man was being trained to learn 
word pairs in a neighbouring room. The 
participants had to press a button to deliver 
an electric shock of escalating strength to the 
learner when he made an error; when they 
did so, they heard his cries of pain. In reality, 
the learner was an actor, and no shock was 
ever delivered. Milgram’s aim was to see how 
far people would go when they were ordered 
to step up the voltage. 

Routinely, an alarming two-thirds of par- 
ticipants continued to step up shocks, even 
after the learner was apparently rendered 
unconscious. But Milgram did not assess his 


ADAPTED FROM E. A. CASPAR ET AL. (2016) 


participants’ subjective feelings as they were 
coerced into doing something unpleasant. 
And his experiments have been criticized for 
the deception that they involved — not just 
because participants may have been trauma- 
tized, but also because some may have guessed. 
that the pain wasn’t real. 

Modern teams have conducted partial and 
less ethically complicated replications of Mil- 
gram’s work. But Haggard and his colleagues 
wanted to find out what participants were feel- 
ing. They designed a study in which volunteers 
knowingly inflicted real pain on each other, and 
were completely aware of the experiment’s aims. 

Because Milgram’s experiments were so con- 
troversial, Haggard says that he took “quite a 
deep breath before deciding to do the study”. 
But he says that the question of who bears 
personal responsibility is so important that he 
thought it was “worth trying to do some good 
experiments to get to the heart of the matter.” 

In his experiments, the volunteers (all were 
female, as were the experimenters, to avoid gen- 
der effects) were given £20 (US$29). In pairs, 
they sat facing each other across a table, with 
a keyboard between them (see “Thoroughly 
modern Milgram). A participant designated 
the ‘agent’ could press one of two keys; one did 
nothing. But for some pairs, the other key would 
transfer 5p to the agent from the other partici- 
pant, the ‘victim; for others, the key would also 


deliver an electric shock to the victim’s arm. 
(The shock was calibrated to the participant, so 
that it was painful but bearable.) In one experi- 
ment, an experimenter stood next to the agent 
and told her which key to press. In another, the 
experimenter looked away and gave the agent a 
free choice about which key to press. 

Whichever key was chosen, a tone sounded 
after a few hundred milliseconds, and both 
volunteers were asked to judge how long this 
took. Psychologists have established that people 
perceive the interval between an action and its 
outcome as shorter when they act of their own 
free will, for example to move their arm, than 
when the action is passive — having their arm 
moved by someone else. As a result, the time 
that the participant thinks has elapsed between 
the key press and the tone acts as a measure of 
their sense of being responsible for their actions. 

When they were ordered to press a key, the 
participants perceived the time to the tone as 
longer than when they had free choice — as if 
their action had been passive. 

In a separate experiment, volunteers fol- 
lowed similar protocols while electrodes on 
their heads recorded their neural activity 
through EEG (electroencephalography). When 
ordered to press a key, their EEG recordings 
were quieter — suggesting, says Haggard, that 
their brains were not processing the outcome of 
their action. Some participants later reported 
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feeling reduced responsibility for their action. 

Unexpectedly, giving the order to press the 
key was enough to cause the effects, even when 
the keystroke led to no physical or financial 
harm. “Tt seems like your sense of responsibil- 
ity is reduced whenever someone orders you to 
do something — whatever it is they are telling 
you to do,” says Haggard. 

The study might inform legal debate, but it 
also has wider relevance to other domains of 
society, says Sinnot-Armstrong. For example, 
companies that want to create — or avoid — a 
feeling of personal responsibility among their 
employees could take its lessons on board. m 


CORRECTIONS 

The News story ‘Scientists probe Zika link to 
birth defects’ (Nature 530, 142-143; 2016) 
wrongly stated that at least seven countries 
have shown abnormal rates of microcephaly; 
this has happened only in Brazil. In addition, 
the article implied that 404 confirmed cases 
of microcepahly in Brazil could be linked 

to Zika virus — only 17 of those 404 have 
been linked so far. Finally, the correction 
note for the Editorial ‘Blue future’ (Nature 
529, 255-256; 2016) omitted to say that 
the carbon sequestration in question was by 
living organisms in the oceans. 
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ILLUSTRATION BY ANNA PARINI 


rUTURE 
GENERATIONS 


he effects on distant tomorrows of the 
decisions we make today have never 
been greater. As we change our planet, 
ourselves and, potentially, our descend- 
ants, in ever more dramatic ways, this 
issue of Nature takes stock: do we have the 
brains and the tools to understand and account 
for the future and, if not, what should be done? 

Technology experts foresee a world just a few 
decades away that is so radically different from 
today that it is hard to comprehend. The expo- 
nential rate of progress in a suite of enabling 
technologies, ranging from computer-process- 
ing power to communication, could drive dras- 
tic changes in artificial intelligence, robotics, 
molecular biology and more (see page 398). 

Some think that the people who inhabit 
this world might also be irrevocably altered, 
for the first time, by genetic engineering. The 
arrival of the powerful genome-editing tech- 
nology CRISPR-Cas9 might prevent children 
from being born with some deadly disorders 
or disabilities, and a feature on page 402 dis- 
cusses the extent to which this is possible 
and desirable. 

Forecasting is hard and fraught with bias. 
For example, as Nicholas Stern warns on page 
407, current models of climate economics 
implicitly assume that lives in the future are 


less important than those today — a value 
judgement that is rarely scrutinized and diffi- 
cult to defend. And, as Celine Kermisch writes 
on page 383, near and remote future genera- 
tions have very different needs. Hundreds of 
social-science studies highlight the tensions 
between our tendencies to care about the 
well-being of others yet to favour current ben- 
efits over future ones. Therefore, on page 413, 
behavioural economists Helga Fehr-Duda and 
Ernst Fehr call for the design of sustainable- 
development policies and schemes that exploit 
these evolved behaviours. 

Finally, John Bongaarts expresses the view on 
page 409 that the best thing we could do now 
for future generations is to ensure that there 
are fewer of them, by doubling the aid spent on 
family planning. 

The only certainties are that tomorrow's 
world is difficult to predict, is heading straight 
for us, and that billions more people will inhabit 
it. How we account for future impacts in today’s 
decisions should preoccupy researchers and 
policymakers more than it does now. m 


FUTURE GENERATIONS 


A Nature special issue 
nature.com/futuregenerations 
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ILLUSTRATION BY GREYGOUAR 


n March 2001, futurist Ray Kurzweil published an essay 
arguing that humans found it hard to comprehend their own 
future. It was clear from history, he argued, that technological 
change is exponential — even though most of us are unable to 
see it — and that ina few decades, the world would be unrecog- 
nizably different. “We wont experience 100 years of progress in 
the 21st century — it will be more like 20,000 years of progress 
(at today’s rate),” he wrote, in “The Law of Accelerating Returns. 
Fifteen years on, Kurzweil is a director of engineering at Google 
and his essay has acquired a cult following among 
futurists. Some of its predictions are outlandish or 
overhyped — but technology experts say that its basic 
tenets often hold. The evidence, they say, lies in the 
exponential advances in a suite of enabling technolo- 
gies ranging from computing power to data storage, 
to the scale and performance of the Internet (see 
‘Onwards and upwards’). These advances are creat- 
ing tipping points — moments at which technologies 
such as robotics, artificial intelligence (AJ), biology, 
nanotechnology and 3D printing cross a threshold 
and trigger sudden and significant change. “We live in 
amind-blowingly different world than our grandpar- 
ents,” says Fei-Fei Li, head of the Stanford Artificial 
Intelligence Laboratory in California, and this will be 
all the more true for our children and grandchildren. 
Kurzweil and others have argued that people find this pace of change 
almost impossible to grasp, because it is human nature to perceive rates 
of progress as linear, not exponential — much as when one zooms in on 
a small part ofa circle and it appears as an almost straight line. People 
tend to focus on the past few years, but pulling back reveals a much more 
dramatic change. Many things that society now takes for granted would 
have seemed like futuristic nonsense just a few decades ago. We can 
search across billions of pages, images and videos on the web; mobile 
phones have become ubiquitous; billions of connected smart sensors 
monitor in real time everything from the state of the planet to our heart- 
beats, sleep and steps; and drones and satellites the size of shoeboxes 
roam the skies. 

If the pace of change is exponentially speeding up, all those advances 
could begin to look trivial within a few years. Take ‘deep learning; a 
form of artificial intelligence that uses powerful microprocessor chips 
and algorithms to simulate neural networks that train and learn through 
experience, using massive data sets. Last month, the Google-owned 
Al company DeepMind used deep learning to enable a computer to 
beat for the first time a human professional at the game of Go, long 
considered one of the grand challenges of AI. Researchers told Nature 
that they foresee a future just 20 years from now — or even sooner 
— in which robots with AI are as common as cars or phones and are 
integrated into families, offices and factories. The “disruptive expo- 


WE LIVE IN A 
MIND-BLOWINGLY 
DIFFERENT 
WORLD THAN OUR 
GRANDPARENTS. 


at the Massachusetts Institute of Technology in Cambridge. 

After decades in development, applications of AI are moving into the 
real world, says Li, with the arrival of self-driving cars, virtual reality and 
more. Progress in AI and robotics is likely to accelerate rapidly as deep- 
pocketed companies such as Google, Apple, Facebook and Microsoft 
pour billions of dollars into these fields. Gill Pratt, former head of the 
US Defense Advanced Research Projects Agency’s Robotics Challenge, 
asked last year whether robotics is about to undergo a ‘Cambrian explo- 
siom — a period of rapid machine diversification (G. A. Pratt J. Econ. 
Perspect. 29, 51-60; 2015). Although a single robot 
cannot yet match the learning ability of a toddler, 
Pratt pointed out that robots have one huge advan- 
tage: humans can communicate with each other at 
only 10 bits per second — whereas robots can com- 
municate through the Internet at speeds 100 million 
times faster. This could, he said, result in multitides 
of robots building on each other’s learning experi- 
ences at lightning speed. Pratt was hired last Sep- 
tember to head the Toyota Research Institute, a new 
US$1-billion AI and robotics research venture head- 

uartered in Palo Alto, California. 

Many researchers say that it is important to pre- 
pare for this new world. “We need to become much 
more responsible in terms of designing and operating 
these robots as they become more powerful,” says Li. 
In January 2015, a group including Elon Musk, Bill Gates and Stephen 
Hawking penned an open letter calling for extensive research to maxi- 
mize the benefits of AI and avoid its potential pitfalls. The letter has now 
been signed by more than 8,000 people. 

Yet predicting the future can be a fool’s game — and not everyone 
is convinced that technological change will hit humanity quite so fast. 
Ken Goldberg, an engineer at the University of California, Berkeley, 
questions the idea that technologies advance exponentially across the 
board, or that those that do will continue indefinitely. “The danger of 
overly optimistic exuberance is that it could set unrealistic expecta- 
tions and trigger the next AI winter,’ he says, alluding to periods in AI’s 
history where hype gave way to disappointment followed by steep cuts 
in funding. Goldberg says that recent warnings that AI and robots risk 
surpassing human intelligence are “greatly exaggerated”. 

And Stuart Russell, a computer scientist at the University of 
California, Berkeley, questions the notion that exponential advances in 
technology necessarily lead to transformative leaps. “If we had comput- 
ers a trillion times faster we wouldn't have human-level AI; half in jest, 
one might say we'd just get wrong answers a trillion times sooner,’ he 
says. “What matters are real conceptual and algorithmic breakthroughs, 
which are very hard to predict.” 

Russell did sign the Hawking letter — and says it is important not 
to ignore the ways that technologies could be taken in potentially 


nentials” of technological change will create “a 
world where everybody can have a robot and 
robots are pervasively integrated in the fabric 
of life’, says Daniela Rus, head of the Computer 
Science and Artificial Intelligence Laboratory 


FUTURE GENERATIONS 


A Nature special issue 
nature.com/futuregenerations 


harmful directions with profound results. “We 
made this mistake with fossil-fuel technologies 
100 years ago — nowit’s probably too late.” m 


Declan Butler is a senior reporter for Nature. 
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omorrow’s children 


What would genome editing really mean for future generations? 


uthie Weiss’s basketball team 
seemed to be minutes away 
from its fourth straight loss. But even as she 
stood on the sidelines for a brief rest, the nine- 
year-old had not given up. She convinced the 
coach to put her back in the game. Then, she charged out 
onto the court, caught a pass from a teammate and drove 
straight to the basket. Swish! Ruthie scored a quick two 
points, putting her team in the lead. As the game clock 
wound down, she scored again, clinching the victory. The 
team had earned its first win of the season, and celebrated 
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as if it had just taken the national 
championship. A couple of parents from 
the opposing team even stopped by to congratulate Ruthie, 
who had scored all of her team’s 13 points: “Wow, she’s 
unbelievable!” they told her mum and dad. 

What makes Ruthie’s performance even more 
extraordinary is her DNA. Because of a misspelling in one 
of her genes, she has albinism: her body produces very lit- 
tle of the pigment melanin, which means that her skin and 
hair are fair, and that she is legally blind. Her visual acuity 
is ten times worse than average. She is still learning to read 
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PRESTON GANNAWAY 


Ruthie Weiss walks home from playing basketball with her father (centre) 
and uncle. Her sight problems have not prevented her from excelling at sport. 


and will probably never be able to drive a car, but she can make out the 
basket and her teammates well enough to shoot, pass and play. 

In January, Ruthie’s dad Ethan asked her whether she wished that her 
parents had corrected the gene responsible for her blindness before she 
was born. Ruthie didn't hesitate before answering — no. Would she ever 
consider editing the genes of her own future children to help them to 
see? Again, Ruthie didn't blink — no. 

The answer made Ethan Weiss, a physician-scientist at the University 
of California, San Francisco, think. Weiss is well aware of the rapid 
developments in gene-editing technologies — techniques that could, 
theoretically, prevent children from being born with deadly disorders 
or with disabilities such as Ruthie’s. And he believes that if he had had 
the option to edit blindness out of Ruthie’s genes before she was born, 
he and his wife would have jumped at the chance. But now he thinks 
that would have been a mistake: doing so might have erased some of the 
things that make Ruthie special — her determination, for instance. Last 
season, when Ruthie had been the worst player 
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have made us and her different in a way that we would have regretted”, 
he says. “That’s scary.’ 

Ethan and Ruthie are not the only people pondering these kinds 
of questions. The emergence of a powerful gene-editing technology, 
known as CRISPR-Cas9, has elicited furious debate about whether and 
how it might be used to modify the genomes of human embryos. The 
changes to their genomes would almost certainly be passed down to 
subsequent generations, breaching an ethical line that has typically been 
considered uncrossable. 

But emerging technologies are already testing the margins of what 
people deem acceptable. Parents today have unprecedented control 
over what they pass on to their children: they can use prenatal genetic 
screening to check for conditions such as Down's syndrome, and choose 
whether or not to carry a fetus to term. Preimplantation genetic diagno- 
sis allows couples undergoing in vitro fertilization to select embryos that 
do not have certain disease-causing mutations. Even altering the herit- 
able genome — as might be done if CRISPR were used to edit embryos 
— is acceptable to some. Mitochondrial replacement therapy, which 
replaces a very small number of genes that a mother passes on with 
those from a donor, was approved last year in the United Kingdom for 
people who are at risk of certain genetic disorders. 

Many safety, technical and legal barriers still stand in the way of 
editing DNA in human embryos. But some scientists and ethicists say 
that it is important to think through the implications of embryo editing 
now — before these practical hurdles are overcome. What sort of world 
would these procedures create for those currently living with disease 
and for future generations? 

So far, little has been heard from the people who could be first affected 
by the technology — but speaking with these communities reveals a 
diverse set of views. Some are impatient, and say that there is a duty to 
use genome editing quickly to eliminate serious, potentially fatal condi- 
tions. Some doubt that society will embrace it to the degree that many 
have feared, or hoped. Above all, people such as Ethan Weiss caution 
that if policymakers do not consult people with disabilities and their 
families, the technology could be used unthinkingly, in ways that harm 
patients and society, today and in the future. 

“Hearing the voices of people who live with these conditions is 
really important;’ says Tom Shakespeare, a medical sociologist at the 
University of East Anglia in Norwich, UK. 


THE CASES FOR 

John Sabine, now 60, was once described as one of the brightest legal 
minds of his generation in England. Now, he is in the advanced stages of 
Huntington's disease: he cannot walk or talk, is incontinent and requires 
constant care. Charles Sabine, his younger brother, carries the same 
genetic glitch that causes Huntington's disease, and therefore knows 
that, like his brother and his father before him, he is destined to undergo 
the same deterioration of brain and body. 

Charles and his brother have five children between them, each of 
whom as a 50% chance of having inherited the mutation that causes 
Huntington's disease. To Charles — and to many others who live with 
the mutation that causes Huntington's — there is no legitimate ethical 
argument about whether gene editing should be used, either to treat 
people living with the condition now or to spare their children from it. 

“Anyone who has to actually face the reality of one of these diseases is 
not going to have a remote compunction about thinking that there is any 
moral issue at all,” Sabine says. “If there was a room somewhere where 
someone said, ‘Look, you can go in there and have your DNA changed; 
I would be there breaking the door down” 

Matt Wilsey, a technology entrepreneur in San Francisco, would be 
there too. His daughter Grace was one of the first people in the world 
to be diagnosed with a disease caused by a 


on her basketball team, she had decided on her 
own to improve, and unbeknownst to her par- 
ents had been practising at every opportunity. 
Changing her disability, he suspects, “would 


FUTURE GENERATIONS 


A Nature special issue 
nature.com/futuregenerations 


mutation in the gene NGLY1, which makes it 
difficult for her cells to get rid of misshapen 
proteins. Grace, now six years old, has severe 
movement and developmental disabilities. 
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She can barely walk and cannot talk. Because her condition is new to 
medicine, doctors cannot even predict how long she might live. 

Wilsey is bullish on CRISPR. He says that if he had had the chance to 
detect and fix the mutation in Grace’s genome before she was born, he 
would have. But he is frustrated that the debate over editing embryos 
seems to have monopolized discussions about the technology. He is 
hopeful that a gene-therapy-like approach using CRISPR, which would 
be free of the ethical concerns about altering the genes she passes on, 
could help Grace within several years. And he wonders whether a tem- 
porary moratorium on embryo editing might allow the field to focus 
on such approaches sooner. 

“As a parent with an incredibly sick child, what are we supposed 
to do — sit by on the sidelines while my child dies? There's zero 
chance of that,” Wilsey says. “CRISPR is a bullet train that has left the 
station — there’s no stopping it, so how can we harness it for good?” 

A meeting convened in December 2015 by the US national 
academies of sciences and medicine, the Chinese Academy of Sciences 
and the Royal Society of London recommended such a moratorium in 
light of multiple safety and ethical concerns. Still, many bioethicists 
and scientists have argued that if defects in single genes causing fatal 
and debilitating conditions could be corrected 
in an embryo, then they should be. Shake- 
speare notes that embryo editing for condi- 
tions that cause major disability and death are 
likely to raise less concern and criticism in the 


“These kinds of 
interactions significantly 


everyone. The experience of disability, she adds, is universal; all people 
inevitably experience sickness, accidents and age-related decline. “At 
our peril, we are right now trying to decide what ways of being in the 
world ought to be eliminated,’ she says. 

Padden says that ethicists, patients and disabilities-studies researchers 
must work urgently to make a broad societal case in favour of greater 
acceptance of diversity. This has been a long-fought battle, and many see 
evidence of progress — for instance, in the ‘neurodiversity’ movement, 
which champions the idea that medical conditions such as autism are 
part of the spectrum of human variation. “We do have to start coming 
up with more arguments for diversity, and quickly, because CRISPR is 
coming upon us faster than some of us are thinking about this issue,” 
she says. 


MAKING CHOICES 
The prospect of editing the genome of a human embryo is still in its 
early stages, but the ability to prevent the inheritance of some conditions 
already exists. Prenatal screening, which has advanced to the point that 
doctors can sample a developing fetus’s DNA through its mother’s blood, 
has given parents the option to terminate pregnancies when a disease or 
disability is diagnosed. This has already started 
to show limited effects on the population. 

In Europe, for example, the prevalence of a 
Down's syndrome diagnosis during pregnancy 
has risen from 20 cases per 10,000 in 1990 to 


long term. But, he says: “As soon as you get change our attitudes 23 cases per 10,000 today, as the average age of 
away from the archetypal terrible condition, H women having babies has increased. But the 
then you've got a debate about whether a con- about what kinds of number of children born with the syndrome 
dition makes life unbearably hard.” people matte r in the has stayed level at about 11 per 10,000, because 

a many women whose fetuses are diagnosed with 
SOCIAL CONSEQUENCES world. the condition terminate their pregnancies. 


Many people are concerned about where that 

line would be drawn. Although it may seem now that only a few, very 
severe conditions should be subject to gene editing, disability activists 
point out that the list of conditions considered as illnesses, and possibly 
subject to medical treatment, is expanding. “More and more, people 
think of obesity or predisposition to alcoholism as disease,’ says Carol 
Padden, a linguist at the University of California, San Diego. She herself 
is deaf, and points out that many deaf people do not consider it a disabil- 
ity. This stance has led to controversy when, for instance, deaf parents 
decline technologies such as cochlear implants for their children, or 
even go so far as to select, through processes such as preimplantation 
genetic diagnosis, children who will be deaf. 

Like Padden, some disability-studies researchers do not oppose the 
idea of gene editing, but do think that society needs to understand that 
itis not possible to eliminate all disability, and that humans might lose 
something important if they try to do so. 

Padden points out that accommodations originally intended for 
people with disabilities often end up benefiting everyone. For example, 
the development of closed captioning — subtitles for the hearing- 
impaired on television — required major advocacy from the deaf com- 
munity and legislative action to get off the ground in the United States 
in the 1970s. Today, people rely on it in ways that no one could have 
foreseen, such as in noisy airports and sports bars. Some people use it 
to learn to read or to learna language. 

Rosemarie Garland-Thomson, a literature scholar and co-director of 
the Disability Studies Initiative at Emory University in Atlanta, Geor- 
gia, adds that legislative mandates, such as the 1990 Americans with 
Disabilities Act in the United States, have helped to integrate people 
with disabilities into society — in workplaces, schools and other public 
spaces. Asa result, the world is much more humane for everyone, says 
Garland-Thomson. “These kinds of interactions significantly change 
our attitudes about what kinds of people matter in the world” 

The idea that parents should edit out characteristics that are 
considered debilitating goes against this drive towards inclusion, 
Garland-Thomson warns, and could create a harsher social climate for 
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In the United States, pregnancies in which a 
Down's syndrome diagnosis is made are terminated in 67-85% of cases. 

By surveying women whose fetuses and babies are diagnosed with 
Down's syndrome, and by compiling similar surveys from around the 
world, medical geneticist Brian Skotko of the Massachusetts General 
Hospital in Boston has found that doctors sometimes advise women 
to terminate or give up for adoption babies diagnosed prenatally with 
Down's syndrome. They can influence the decision by using phrases 
such as “Tm sorry’, or “I have some bad news to share”. For instance, 
34% of 71 Dutch women who terminated their pregnancy after a 
Down's syndrome diagnosis said that their doctors did not even men- 
tion the possibility of carrying the pregnancy to term when discussing 
their options’. 

Mark Leach, a lawyer in Louisville, Kentucky, whose 11-year-old 
daughter has Down's syndrome, says that he and his wife have been 
asked many times — especially when his wife was pregnant with their 
second child — whether they “knew beforehand” that Juliet would be 
born with Downs. (They didn’t.) Some people are simply curious, Leach 
says, but for others, there’s judgement in that question. “The ability to 
do something beforehand imposes a sense of, “You should do not only 
what's right for you, but what's right for society,” Leach says. It bothers 
him, he says, that although government and private health insurers 
routinely pay for prenatal diagnosis, he recently learned that his school 
system is discontinuing support for the learning specialist who had been 
helping Juliet to thrive in mathematics and reading. 

Dorothy Roberts, a professor of law and sociology at the University of 
Pennsylvania in Philadelphia, says that this kind of pressure is troubling 
and that it could get worse if embryo editing were to become readily 
available. “Women should not be given the responsibility of ensuring 
the genetic fitness of their children based on lack of support for children 
with disabilities” 

Leach knows that children with disabilities can live rich lives. Juliet 
likes ballet dancing and horse-riding, and she is especially attuned to 
the names of people and animals whom she knows. And Leach says 
that she helps to remind other people how to care for others. “The main 
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thing that would be lost if Down’s syndrome continues to diminish is 
a diminishment in the amount of compassion that is shown in this 
world,” he says. 

Even among people who already have life-threatening conditions, 
many choose not to interfere with the way the genetic cards are dealt. 
Edward Wild, a neurologist who cares for people with Huntington’s 
disease at University College London, estimates that fewer than 5% of 
patients in the United Kingdom use preimplantation genetic screening 
to select embryos that lack the disease-causing mutation and so avoid 
passing it to their children. Some people do not know that they have the 
mutation; some decide against screening because of the costs or risks 
involved; some have personal or moral objections to the technique; 
and some just have a sense that 50-50 odds of passing down the disease 
are not so bad. 

“Having kids the fun way is still much more popular than having kids 
the science way, even though the latter is how you guarantee the kid is 
free of Huntington's disease,” Wild says. 

Even if gene editing were safe, effective and everyone opted to use it, 
it would not eliminate genetic diseases, because researchers still have 
a long way to go to understand the genes involved. Even Huntington's, 
which is fairly well characterized, is no easy 
target. The glitch that causes it is a repeat ofa 
particular genetic sequence; the more repeats, 
the more severe the symptoms, and repeats are 
added with each successive generation. New 
families are diagnosed with Huntington’ all 
the time, either because the disease is misdiag- 
nosed in older generations or because symp- 
toms worsen, and become recognizable, in 
subsequent generations. Although he is work- 
ing on genetic techniques to treat Huntingon's, 
Wild doesn’t hold out high hopes for a future 
free of the disease. “Although it’s nice to think 
about, it’s little more than a dream,” he says. 

Human biology can complicate things in other ways. Padden notes, 
for instance, that some mutations that predispose to genetic disease, 
such as the sickle-cell mutation, confer population-level benefits, such 
as resistance to malaria. So editing out one disease could backfire by 
increasing the risk of another. She argues that very little is known 
about the potential benefits of other mutations associated with dis- 
ease, and applying genome editing too freely could have unintended 
consequences. 

And if it were adopted, the technology would almost certainly be 
applied unevenly around the world. Aleksa Owen, a sociologist at the 
University of Illinois at Chicago, predicts that genome editing would be 
used first in countries that approve of and support assisted reproductive 
technologies, such as the United Kingdom, some other European Union 
countries, China and Israel. But it would probably be too expensive for 
many people in developing countries. 


UNEVEN ACCESS 

Still, some scientists predict that editing human embryos could 
have transformative effects. During the National Academies’ sum- 
mit on gene editing in December, Harvard University geneticist Dan 
MacArthur tweeted, “Prediction: my grandchildren will be embryo- 
screened, germline-edited. Wont ‘change what it means to be human. 
It'll be like vaccination.” 

Sandy Sufian, a historian of medicine and disability at the University 
of Illinois, agrees with MacArthur that CRISPR has the potential to 
become widely adopted, both because of the perception that it would 
save money that would otherwise be spent caring for disabled people 
and because of people's fear of disability. But she questions the idea that 
eliminating such conditions will necessarily improve human life. Sufian 
has cystic fibrosis, a disease caused by mutations that render her lung 
cells more vulnerable to infection and disease. She spends 40 hours a 
week inhaling medicine to clear her lungs of mucus, exercising and 


“Prediction: my 
grandchildren will be 
embryo-screened, 
germline-edited. Won’t 
‘change what it means to 
be human’. It'll be like 
vaccination.” 
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undergoing physical therapy; others have to quit their jobs to make 
sufficient time for treatments. Yet given the option to edit cystic fibrosis 
out of her bloodline, Sufian wouldn't do it. “There are some great things 
that come from having a genetic illness,” she says. 

Garland-Thomson echoes that sentiment; she has one and a half arms 
and six fingers because of a condition called limb-reduction disorder. 
She says that she values traits in herself that she may have developed as 
adaptations to the condition: she is very sociable and wonders if that is 
because she’s had to learn to work hard to make others feel comfortable 
around her. “Any kinds of restrictions or limitations have created the 
opportunity for me to develop work-arounds,’ Garland-Thomson says. 

Shakespeare, who has achondroplasia, a genetic condition that causes 
shorter than average stature, says that people with disabilities are just 
as able to attain life satisfaction as others. “I have achieved everything 
I hoped for in life, despite having restricted growth: career, children, 
friendship and love.” He wouldn't want to have altered his own genes 
to be taller, he says. 


DISABILITY RIGHTS 

People without disabilities consistently underestimate the life 
satisfaction of those with them. Although 
people with disabilities report a slightly lower 
overall quality of life than those without, the 
difference is small. One study’ found that half 
of people with serious disabilities ranked their 
quality of life as ‘good’ or ‘excellent. 

People also overestimate how severely 
health affects their happiness compared with 
other factors, such as economic or social sup- 
port. One 1978 study’, for instance, compared 
people who had recently become paralysed 
as a result of accidents with people who had 
recently won between US$50,000 and $1 mil- 
lion ina state lottery. Although people who had 
had accidents ranked their happiness lower than lottery winners, both 
groups predicted that their future happiness would be roughly equal, 
and people who had accidents derived more pleasure from everyday 
activities, such as eating breakfast or talking to a friend. 

‘A lot of this terrific science and technology has to take into account 
that the assumption of what life is like for people who are different is based 
on prejudice against disability,’ says Lennard Davis, a disability-studies 
researcher at the University of Illinois, who was raised by two deaf parents. 

There is a common saying among people in the disability-rights 
community: “Nothing about us without us.” People with disabilities 
argue that scientists, policymakers and bioethicists should take steps to 
ensure that the CRISPR debate reflects what is best for patients and their 
families, to ensure its most humane use now and for future generations. 

At a minimum, they say, the investment in developing CRISPR 
should be matched by investments in innovations to help people who 
are already living with conditions that cause disability. And it is essential 
that people with the conditions that are up for consideration as possible 
CRISPR targets should be included in the decision-making processes. 

For their part, Ruthie Weiss and her dad have already made up their 
minds. Ruthie must work harder than her classmates to do some routine 
activities. But when she is dominating the basketball court, or practis- 
ing the piano, or skiing down a mountain, Ethan Weiss doesn't see a 
child with a disability. He sees his daughter making the most of her life, 
given all her strengths and challenges. And he knows that he wouldn't 
change a thing. = 


Erika Check Hayden writes for Nature from San Francisco. 
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Sathkira District, Bangladesh, still flooded a year after 2009’s Cyclone Aila. 


Current climate models are 
orossly misleading 


Nicholas Stern calls on scientists, engineers and economists to help policymakers by 
better modelling the immense risks to future generations, and the potential for action. 


Te twin defining challenges of our 
century are overcoming poverty and 
managing climate change. If we can 
tackle these issues together, we will create 
a secure and prosperous world for genera- 
tions to come. If we don't, the future is at 
grave risk. 

Researchers across a range of disciplines 
must work together to help decision-makers 
in the public, private and non-profit sectors 
to rise to these challenges. Economists, in 
particular, need more help from scientists 


and engineers to devise models that provide 
better guidance about what will happen if we 
succeed or if we fail. 

As the 2015 Paris agreement on climate 
change made clear, we must achieve a 
net-zero carbon economy this century. 
Doing so will require policies that drive 
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innovation, investment and entrepreneur- 
ship. The political will to make the neces- 
sary decisions depends partly on improving 
the analysis and estimates of the economics 
of climate change. Then the consequences 
of unmanaged global warming can be 
weighed much more transparently against 
the investments and innovations necessary 
to mitigate it. 

Current economic models tend to under- 
estimate seriously both the potential impacts 
of dangerous climate change and the 
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> wider benefits of a transition to low- 
carbon growth. There is an urgent need for 
anew generation of models that give a more 
accurate picture. 


DARK IMPACTS 

The Fifth Assessment Report of the Inter- 
governmental Panel on Climate Change 
(IPCC), published in 2013 and 2014, pro- 
vided a comprehensive overview of the 
literature on the costs of action and inaction. 
But the assessment understated the limita- 
tions of the research done so far. Essentially, 
it reported on a body of literature that had 
systematically and grossly underestimated 
the risks of unmanaged climate change. 
Furthermore, that literature had failed to 
capture the learning processes and econo- 
mies of scale involved in radical structural 
and technical change, and the benefits of 
reducing fossil-fuel pollution, protecting 
biodiversity and forests, and so on. 

The IPCC pointed out’ that estimates of 
losses resulting from a 2°C increase in mean 
global temperature above pre-industrial 
levels ranged from 0.2% to 2% of global 
gross domestic product. It admitted that 
the global economic impacts are “difficult 
to estimate” and that attempts depend ona 
large number of “disputable” assumptions. 
Moreover, many estimates do not account 
for factors such as catastrophic changes and 
tipping points. 

It is these hard-to-predict impacts that 
are the most troubling potential conse- 
quences of inaction. The next IPCC report 
needs to be based on a much more robust 
body of economics literature, which we 
must create now. It could make a crucial 
difference. 

Many estimates of economic losses 
are based on the outputs of integrated 
assessment models (IAMs). These mod- 
els attempt to combine the key elements 
of biophysical and economic systems. 
This is a worthy endeavour. Sadly, most 
IAMs struggle to incorporate the scale of 
the scientific risks, such as the thawing of 
permafrost, release of methane, and other 
potential tipping points. Furthermore, 
many of the largest potential impacts are 
omitted, such as widespread conflict as a 
result of large-scale human migration to 
escape the worst-affected areas. 

For instance, there is evidence that tem- 
perature increases of 1.5°C and 2°C would 
lead to differing extents of sea-level rise and 
extreme weather events’, with obvious impli- 
cations for small island states and coastal 
communities. These differences are simply 
not represented in the flawed estimates of 
economic losses. 

IAMs are also used to calculate the social 
cost of carbon (SCC). They attempt to 
model the incremental change in, or dam- 
age to, global economic output resulting 


from 1 tonne of anthropogenic carbon 
dioxide emissions or equivalent. These 
SCC estimates are used by policymakers in 
cost-benefit analyses of climate-change- 
mitigation policies. 

Because the IAMs omit so many of the 
big risks, SCC estimates are often way too 
low. Asa first step, the consequences being 
assessed should include the damages to 
human well-being and loss of life beyond 
simply reduced economic output. And the 
very large uncertainty, usually involving 
downward bias, in SCC estimates should 
always be made explicit. 

As the IPCC acknowledged’, published 
SCC estimates “lie between a few dollars 
and several hundreds 


of dollars”. These “Discount 
values often depend ratesare 
crucially on the ‘dis- central to any 
counting’ used to discussion of 
translate future costs gouwrhandinthe 


to current dollars. The 
high discount rates 
that predominate 
essentially assume that benefits to people 
in the future are much less important than 
benefits today. 

These discount rates are central to any 
discussion of our hand in the fate of future 
generations. Most current models of climate- 
change impacts make two flawed assump- 
tions: that people will be much wealthier in 
the future and that lives in the future are less 
important than lives now. 

The former assumption ignores the 
great risks of severe damage and disrup- 
tion to livelihoods from climate change. 
The latter assumption is ‘discrimination by 
date of birth’ It is a value judgement that is 
rarely scrutinized, difficult to defend and in 
conflict with most moral codes. 


fate of future 


generations.” 


COSTING TRANSITION 

The other role of IAMs — to estimate the 
costs of climate-change mitigation — also 
suffers from major shortcomings. 

The IPCC’s mitigation assessment’ con- 
cluded from its review of IAM outputs 
that the reduction in emissions needed to 
provide a 66% chance of achieving the 2°C 
goal would cut overall global consump- 
tion by between 2.9% and 11.4% in 2100. 
This was measured relative to a ‘business 
as usual’ scenario. Clearly, growth itself 
can be derailed by climate change from 
business-as-usual emissions. 

So the business-as-usual baseline, against 
which costs of action are measured, conveys 
a profoundly misleading message to policy- 
makers that there is an alternative option 
in which fossil fuels are consumed in ever 
greater quantities without any negative con- 
sequences to growth itself. 

Crucially, IAMs generally omit the 
potentially huge costs of air pollution 
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from fossil fuels — which are saved if 
alternative fuels are used*. IAMs struggle 
to describe developments in alternative 
energy. They fail, in general, to capture the 
feedback loops in the innovation process 
that interact across the economy, prompt- 
ing institutional and behavioural change, 
possible discoveries and economies of scale. 
There is empirical evidence, for example, 
that the geographical location of research- 
ers and inventors can affect whether a firm 
chooses to do clean or dirty innovation. 

The initial investment required to cata- 
lyse the transition to a low-carbon pathway 
might lead to great economic benefits in 
the long run. These could go well beyond 
avoided climate risks’. The knowledge 
spillover from low-carbon innovation into 
the wider economy — for instance, a bat- 
tery developed for electric vehicles being 
used in wheelchairs — seems to be greater 
overall than that from high-carbon-energy 
technologies®. 

As engineers learn how to install, connect 
and repair technology cheaply, unit costs 
fall faster for many new technologies than 
for existing ones. This has already allowed 
solar-photovoltaic and onshore-wind tech- 
nologies to become competitive with natu- 
ral gas and coal in several locations, even 
without emissions taxation. 

Also influential will be the emergence of 
new networks, such as the integration of 
electric-vehicle-energy storage into smart 
grids, as well as rapid technical progress. 
And these steps can be accelerated if, for 
example, consumers change behaviour and 
demand support for resource efficiency, 
recycling and pedestrianization. It is clear 
that much will depend on urban manage- 
ment and design; as cities grow rapidly, 
damaging infrastructure can become 
‘locked in. 


WHAT’S NEEDED? 

There is much that can be done to make 
the assumptions in standard IAMs more 
realistic with respect to the scale and nature 
of damages”*. But to give policymakers the 
reliable information that they need when 
implementing the Paris agreement, incre- 
mental improvements”® to the present gen- 
eration of IAMs may not be enough. 

A comprehensive review of the problems 
of using IAMs in climate economics’ called 
for the research community to develop a 
“third wave” of models. The authors iden- 
tify various types of model that might offer 
advances. Two are: dynamic stochastic 
computable general equilibrium (DSGE) 
models, and agent-based models (ABMs). 

Like current I[AMs, DSGE models can 
explicitly account for uncertainty about the 
future through the introduction of shocks, 
for instance, to economic output, consump- 
tion or climate damages’. ABMs, by contrast, 
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seek to provide more-realistic represen- 
tations of socio-economics by simulating 
the economy through the interactions of 
a large number of different agents, on the 
basis of specific rules. ABMs are widely 
used in finance, but have yet to be seri- 
ously applied to climate change. These are 
promising developments. 

Now, a concerted effort is required by 
the research community to explore as 
many potential avenues as possible to 
better estimate the costs of action and 
inaction on climate change. The IPCC 
should distil what policymakers need to 
inform their decision-making. Learned 
societies and national academies must 
bring together researchers from a wide 
range of relevant disciplines to focus 
attention on improving economic 
modelling quickly. 

Bangladeshi farmers and Cairo city- 
dwellers are at severe risk of flooding 
and storms; southern Europe and parts 
of Africa and the Americas are threat- 
ened by desertification. Perhaps hun- 
dreds of millions of people may need to 
migrate as a result, posing an immense 
risk of conflict. 

There is huge potential in future tech- 
nologies that can drive change. These are 
omitted or badly underestimated in our 
current climate modelling — deeply dam- 
aging our guidance for policymaking. 
The well-being and prosperity of future 
generations are worth more. m 
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> and the world’s ecosystems (see ‘Four 
ways population rise takes its toll’). Many 
countries would be better off with lower 
population growth and birth rates. This is 
exemplified by the east Asian ‘tiger’ econo- 
mies, including South Korea and Taiwan, 
that in recent decades have seen rapid 
increases in per capita incomes as birth rates 
declined. The boost to economic growth that 
follows a decline in fertility is referred to as 
the ‘demographic dividend”. 

So, how can the population juggernaut be 
stopped, or at least slowed? 


WHAT CAN BE DONE 
Since the 1960s, the main policy response to 
rapid population growth, alongside invest- 
ments to improve education and health, 
has been the implementation of voluntary 
family-planning programmes’. To make 
contraception accessible, these programmes 
train staff, subsidize costs and develop dis- 
tribution schemes — for example, clinics, 
drug stores and other delivery systems. 
Equally important, they provide education 
about the health and the socio-economic 
benefits of using contraception and having 
smaller families. The key reason for these 
programmes is to address the substantial 
level of unwanted and unplanned preg- 
nancy, as well as an unsatisfied demand for 
contraception. About 74 million unplanned 
pregnancies occur each year in the develop- 
ing world (39% of annual developing-world 
pregnancies). About half of these end in 
induced abortions’. 

Among the reasons for unwanted and 
unplanned pregnancies are low levels of 
female education, a lack of knowledge about 


WHERE WILL WE BE? 


1 Projected population growth by region 
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and access to contraception, insufficient 
supplies and services, cost and fear of side 
effects. Just as problematic are opposition 
from spouses and other family members 
and traditional gender roles that support 
a desire for large families. To reduce unin- 
tended pregnancies, family-planning pro- 
grammes must go beyond simply providing 
supplies and services; they must also reduce 
or eliminate these other obstacles. 

The most persuasive evidence that such 
programmes work comes from field experi- 
ments, such as the one that began in 1977 in 
the Matlab region of Bangladesh’. A con- 
trol area of Matlab received the same lim- 
ited family-planning 


services as the rest “Inparticular, 
of the country. An education 
experimentalareawas of girlsisa 
provided with free powerful 
services and supplies, brake on 


home visits by well- 
trained female family- 
planning workers, and comprehensive media 
communication. Outreach to husbands, vil- 
lage leaders and religious leaders addressed 
potential social and familial objections. 

The results were clear-cut. Contraceptive 
use jumped from 5% to 33% among married 
women of reproductive age in the experimen- 
tal area. In the control area, little changed. Asa 
result, fertility declined rapidly in the experi- 
mental area. A difference of about 1.5 births 
per woman between the experimental and 
control areas was observed until 1990, and 
a smaller difference continued beyond 
1996, when the experiment ended. Among 
the long-term consequences of this differ- 
ence were the children in the experimental 


fertility.” 


By 2100, our planet is expected to be home to 11.2 billion 


area being educated to higher levels, fami- 
lies having greater household assets, and 
the greater use of preventive health services. 
The experiment demonstrated that family- 
planning programmes can succeed even in 
highly traditional societies. 

Contemporary evidence is fully consistent 
with the Matlab record. For example, coun- 
tries that have suddenly implemented 
comprehensive family-planning pro- 
grammes (such as Iran in 1989 and Rwanda 
in the mid-2000s) have seen rapid subse- 
quent changes in reproductive behaviour. 

Iran’s fertility declined at an extraordinary 
pace from 5.6 births per woman in the late 
1980s to 2.6 a decade later®. In Rwanda, fer- 
tility dropped from 6.1 in 2005 to 4.6 in 2010, 
and the proportion of married women using 
contraception jumped from 17% to 52%’. 
Both countries’ information programmes 
shifted norms by including messages about 
the benefits of smaller families, raising the 
demand for family-planning services. 

Any acceleration of fertility decline 
changes the trajectory of future population 
growth. This impact can be large, as demon- 
strated by a comparison of alternative United 
Nations population projections’ for sub- 
Saharan Africa. With major investments in 
family planning, the fertility trajectory could 
easily be reduced by 0.5 births per woman 
within a few years after the start of the inter- 
vention. This would lead to a population of 
2.8 billion in sub-Saharan Africa by 2100 
(see ‘Where will we be?’). That is around 
1 billion fewer people than in the business- 
as-usual projection. 

Family-planning programmes are most 
effective where socio-economic conditions 
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people — over 50% more than in 2015. 
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are improving. In particular, education of 
girls is a powerful brake on fertility. Educated 
women marry later, tend to want smaller 
families (in part, because the opportunity 
costs of childbearing are higher for these 
women) and are more capable of overcom- 
ing obstacles to their use of family planning. 
However, educated women must have access 
to contraception to act on their reproduc- 
tive preferences. Family planning, education 
(of women and men) and socio-economic 
development are mutually reinforcing, and 
should be pursued together. 


EVOLUTION OF SUPPORT 

Family-planning programmes have been 
controversial, and support for them has 
fluctuated widely over the past half-century’. 
An initial surge of investment was made in 
the 1960s and early 1970s, as the UN and 
donor countries urged developing countries 
to address high fertility and rapid population 
growth. Several such countries accused the 
industrialized nations of making neo-colo- 
nial and imperialistic efforts to keep them 
weak and impoverished. 

The Catholic Church, long opposed to 
artificial birth control and abortion, took 
an increasingly aggressive stance in oppo- 
sition to government-sponsored family 
planning programmes worldwide. And 
when Republicans have occupied the White 
House, US lawmakers have also generally 
been unsupportive of international family- 
planning programmes — reducing funding 
and adding onerous restrictions. 

Nonetheless, contraceptive use rose 
steadily in Asia and Latin America dur- 
ing the 1980s (see go.nature.com/r2oqrl). 
Then, in the 1990s, aid for family planning 
was diverted to fight the AIDS epidemic in 
Africa and elsewhere. Moreover, the fertil- 
ity declines in Asia and Latin America led to 
optimism that population growth in Africa 
would soon slow. 

Political and religious opponents were 
able to draw support for their views from a 
disagreement stirring in the academic litera- 
ture: some economists argued that voluntary 
family-planning programmes are ineffective’. 
Conventional economic theory considers 
parents as people who want to maximize util- 
ity, and, who — within their budget — choose 
the number of children that they have in the 
same way that they might choose how many 
televisions or bicycles to buy. 

An obvious problem with this reasoning 
is that durable goods require an active pur- 
chase, whereas pregnancies occur unless an 
effort is made to avoid them. Furthermore, 
economic theories also typically assume 
that the cost (monetary, social and practi- 
cal) of contraception is sufficiently low so 
as not to influence decision-making. From 
this academic perspective, the occurrence 
of unwanted pregnancies should be as rare 


Over-65s make up more than one-quarter of Japan’s population. 


DEMOGRAPHIC TRENDS 


Bulges, gaps and shifts 


Ageing. The median age of populations is 
rising, especially in the developed world, 
as a consequence of lower birth rates and 
rising lifespans. Europe and Japan now 
host the highest proportion of people aged 
over 65. Further increase in this proportion 
is expected, putting enormous pressure 
on pension and health-care systems, and 
slowing economic growth. 


Migration. The flow of people from poor 
and war-torn countries and continents to 
nearby developed ones (for example, from 
Africa and the Middle East to Europe) is 
rising. The tension this creates will grow as 
populations in poor countries rise, and while 
the economic disparities between sending 
and receiving countries remain large. 


Low birth rate. In most developed countries 
and in a growing number of Asian and 

Latin American nations, women are having 
fewer than two children each — the level 
needed for long-term population stability. 
This causes population ageing and decline 


as people having unwanted new televisions; 
thus family-planning schemes should not be 
given priority. 

Needless to say, that is not how things 
work in the real world. Avoiding unintended 
pregnancies is difficult even for women who 
use relatively effective methods and as a 
result, tens of millions of such pregnancies 
occur each year. Happily, the accumulating 
evidence of plummeting birth rates in a few 


(for example, the populations of Eastern 
Europe and Japan will probably shrink by 
more than one-third by 2100). The near- 
absence of children to provide support will 
make life difficult for the elderly in countries 
where societal safety nets are weak (Such 
as China). 


Missing women. There are 100 million 
fewer women than naturally expected owing 
to sex-selective abortion and greater female 
mortality throughout life (for example, from 
female infanticide). In the past two decades, 
the number of such abortions has risen 

to about 1.5 million per year as a result of 
increased availability of low-cost ultrasound 
and genetic screening (to determine fetus 
sex), and greater access to methods for 
terminating pregnancies in countries that 
have strong cultural preferences for boys. 
These statistics document the widespread 
gender discrimination that still exists in 
many countries. An excess of single men 
may lead to social unrest and trafficking in 
women and girls. 


countries (such as Ethiopia and Rwanda) 
has now largely persuaded policymakers of 
the cost-effectiveness of these programmes’. 
Religious and political opposition persists. 


WHAT NOW? 

To reduce rapid population growth and 
high birth rates, it is essential to start with 
a clear objective: within a decade, women 
everywhere should have access to quality 
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GROWING PAINS 


Four ways population rise takes its toll 


Environmental degradation. Unprecedented 
global threats such as climate change 

and decreasing biodiversity have been 
building and will become more severe as 
populations, economies and consumption 
grow. Crucial local environmental problems 
— including shortages of fresh water and 
arable land, mounting waste, and air, 

water and soil pollution — adversely affect 
health and threaten the expansion of food 
production required to feed more people a 
better diet. 


Economic stagnation. In poor societies, 
populations often double in size in two 

or three decades. Industries, offices, 
housing, schools, health clinics and 
infrastructure must be built at least at the 
same rate. Many communities are unable 
to keep up — as is evident from high 
unemployment rates, explosive growth of 
slum populations, overcrowded schools 
and health facilities and dilapidated public 
infrastructure (such as roads, sewage 
systems and power grids). Furthermore, in 
rapidly growing regions, about half’ of the 


contraceptive services. Even in rural areas 
of poor countries, women should have the 
choice of multiple contraceptive methods 
— including not only pills, injectables and 
barrier methods, but also long-acting meth- 
ods such as intrauterine devices and systems 
(IUDs and IUSs), implants and sterilization. 
Where legal, safe abortion services should 
be made available. Other obstacles to con- 
traceptive use, such as incorrect rumours 
about side effects and conservative social 
attitudes, should be addressed by the educa- 
tion of women and men, media campaigns 
and collaboration with community leaders. 
These efforts can be led by governments 
but better results are obtained when services 
are distributed through multiple channels, 
including private commercial provid- 
ers and non-governmental organizations. 
Importantly, coercion of any kind should be 
ruled out. Women and men have the right 
to decide freely on the number, timing and 
spacing of children, and on the means to 
achieve their reproductive goals. 
Achieving these aims requires substan- 
tially more resources than are available 
now. Funds are needed to build and equip 
clinics, to train and pay providers and to 
subsidize the direct cost of methods and 
services that are out of the reach of too many 
poor people. Over the past decade, invest- 
ments in the developing world have risen, 
especially after the 2012 London Summit 


population is aged under 20. The low ratio 
of workers to dependents depresses living 
standards and makes it more difficult to 
invest in the physical and human capital 
needed. The size of the formal labour force 
is also limited when women remain at 
home to care for large families. 


Maternal mortality. High birth rates mean 
frequent childbearing. Each pregnancy is 
associated with a risk of death or disability, 
and this is highest in countries with weak 
health-care systems. For example, in the 
poorest countries of West Africa, a woman’s 
risk of dying in childbirth before the end of 
her reproductive years is about 1 in 20. 


Political unrest. Youth unemployment 
becomes widespread when economies 

are unable to provide jobs. Vigorous 
competition for few jobs leads to low wages, 
which in turn contribute to poverty. Large 
numbers of unemployed and frustrated 
young men, in particular, fuel socio- 
economic tensions, high crime rates and 
political instability. 


on Family Planning, at which many donors 
and governments renewed or increased their 
commitments. 

Still, only 1% of all overseas development 
assistance (ODA) is now allocated to fam- 
ily planning’®. This amount is inadequate; 
in too many countries, programmes remain 
weak and politi- 


calcommitmentis “Importantly, 
lacking. coercion of any 
The proportion kind shouldbe 


of ODA allocated ruledout.” 

to family planning 

should be raised to 2% and developing- 
countries governments should expand their 
funding by an equivalent proportion. Sucha 
doubling of funding will be more than repaid 
by savings in other sectors such as education 
and health care in future years. 


FROM THE TOP 

The final, crucial ingredient for success is 
political will and a commitment to family 
planning at the highest levels of national and 
international policymaking. A fundamental 
reason for the low priority assigned to the 
issue is that it is considered a health and 
human-rights problem. 

Hence, family planning is part of the 
health budgets of donors and the respon- 
sibility for family-planning programmes is 
assigned to ministers of health throughout 
the developing world. Most poor countries 
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are battling a range of diseases, and family 
planning is often not seen as a high health 
priority. From a broader development per- 
spective, the low priority is nonsensical. 

Several actions would remedy this situ- 
ation. At the international level, develop- 
ment agencies and donors should hire more 
population experts (the World Bank, for 
example, employs thousands of economists 
but only a few demographers). These organi- 
zations write hundreds of reports on every 
dimension of development, yet only a frac- 
tion comment on population trends. Such 
reports should include a discussion of the 
role of demographic shifts in relevant sec- 
tors, of the development benefits of reduced 
birth rates and of the options available to 
change these trends. 

At the national level, similar changes are 
needed. Typically, population trends are 
noted in government plans but are con- 
sidered immutable and therefore of little 
interest. Ministries of finance or planning 
commissions often make detailed projec- 
tions for specific sectors but rarely examine 
alternative population trajectories. They 
should. Economists at the Copenhagen 
Consensus Center, a think tank, last year 
ranked family planning as one of the most 
cost-effective development interventions 
(see go.nature.com/fumoop). 

Family planning must be reclassified as a 
development intervention (as well as being 
a health and human-rights intervention) to 
give it the high national and global prior- 
ity it deserves. This would lead to a more 
cost-effective use of scarce development 
resources, and to more rapid growth in living 
standards in poor countries. m 
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Game human nature 


Finding ways to adapt natural tendencies and nudge collective action is central to the 
well-being of future generations, say Helga Fehr-Duda and Ernst Fehr. 


undreds of studies in behavioural 
Henne and other social 

sciences, conducted over the past 
few decades, have established that people 
often make choices that take into account 
the well-being of others. This contradicts an 
assumption held over centuries that people 
exclusively pursue their own material well- 
being. It also offers hope for the prospect 
of developing public goods (see “Glossary: 
behavioural economics’) that will benefit 
future generations. 

However, another body of evidence, 
gathered over the past two or three decades, 
indicates that people display an array of 
other tendencies, such as giving excessive 
weight to current benefits over future ones. 
These could hamper policies and initiatives 
aimed at building or sustaining public goods. 

The message emerging is that sustainable 


development will require the design of 


policies and schemes that specifically take 
advantage of some of our natural tendencies, 
and mitigate others. 


SIX TENDENCIES 

What does it mean to care about future 
generations? We posit that, by and large, 
future generations should be able to live 
as comfortably as do current generations 
in the developed world. This goal requires 
economic growth, at least in developing 
countries. It also requires current genera- 
tions to hand down high-quality ‘capital 
stock’ — natural resources, scientific knowl- 
edge, infrastructure and sophisticated eco- 
nomic, legal and educational institutions. All 
of these types of stock are effectively public 
goods: everyone can enjoy them without 
necessarily having paid directly for them. 


When it comes to the likelihood that 
investments in public goods will materialize, 
six behavioural tendencies come into play. 


Other-regarding preferences. Public- goods 
experiments generally involve giving indi- 
viduals in a group small sums of money that 
they can either keep for themselves or spend 
for the benefit of the group. Numerous such 
experiments, as well as field studies in real- 
world settings, have shown that many peo- 
ple have ‘other-regarding preferences’: their 
choices indicate that they care about things 
besides themselves, including the well-being 
of other people and the environment. 
People with other-regarding preferences 
are often willing to share the burden of pro- 
viding public goods as long as others do the 
same: they are ‘conditional cooperators’’. 
Such people often care about social norms 
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and are willing to punish those who violate 
them’. They also tend to be concerned 
about what others might think of them. For 
instance, often the threat of being named 
publicly (within the group) as a defector is 
enough to keep such people cooperating in 
experiments. 

There is considerable potential to lever- 
age these tendencies for the benefit of future 
generations. The effect of social norms and 
conditional cooperation has been dem- 
onstrated by one of the largest-scale field 
experiments ever conducted. Since 2008, a 
US company called Opower has sent letters 
to customers of energy utilities, showing 
them how their electricity use compares with 
that of their neighbours. (Currently, 15 mil- 
lion households in 9 countries receive such 
letters.) This intervention generated a 2% 
reduction in energy consumption. A study 
of the company’s three longest-running pro- 
grammes shows persisting effects after mail- 
ings stop, but the energy savings decrease by 
10-20% per year’. 

The emergence of smoking bans similarly 
provides a powerful example of what can be 
achieved through norm-altering collective 
action. Fifty years ago, smoking in public 
places was ubiquitous in Western countries. 
When it became clear that smoking causes 


GLOSSARY: BEHAVIOURAL ECONOMICS 

@ A public good is a resource such as the 
atmosphere that individuals cannot be 
effectively excluded from using, and for 
which use by one person doesn’t reduce 
the availability of the good to others. 


@ People who care about the well- 
being of others have other-regarding 
preferences. 


@ People who care only about their own 
material well-being have self-regarding 
preferences. 


© Conditional cooperators will share 

the burden of providing public goods or 
comply with social norms that benefit the 
group as long as others do the same. 


@ People whose tolerance of risk 
depends on when in the future the 
consequences of their decisions will 
materialize show delay-dependent risk 
tolerance. 


@ People who become more cautious 

as a result of observing the outcomes of 
their decisions over time show feedback- 
dependent risk aversion. 


@ Short-term impatience is giving 
disproportionate weight to current 
benefits compared to future ones. 


serious cardiovascular and respiratory 
diseases in non-smokers as well as smokers, 
a new social norm emerged. About 20 years 
ago, many countries began to prohibit 
smoking in indoor public spaces, and even 
in outdoor ones. 


Self-regarding preferences. Studies have 
also shown that there will always be some 
people who free-ride on the contributions 
of others. In public-goods experiments, for 
instance, some participants will stop con- 
tributing to the public good, but continue 
to reap the rewards from everyone else’s 
contributions. Ultimately, the cooperation 
seen in these experiments tends to break 
down unless defectors can be punished 
by being made to pay money back to the 
experimenter’. 

Such detrimental effects of free-riding on 
the common good are demonstrated by the 
frequently frustrated attempts of individual 
political activists to overcome dictatorships. 
In these cases, free-riders might refrain from 
joining the opposition because (if the activ- 
ists are successful), they stand to benefit 
without bearing the costs of fighting for 
political freedom. 

Thus, although other-regarding pref- 
erences are a prerequisite for achieving 
sustainable development, uncoordinated 
changes to individuals’ behaviours will not 
be sufficient. Political activists, for instance, 
will succeed only if they are able to turn their 
protests into a coordinated mass movement 
— as occurred in Tunisia in 2010 when civil 
resistance eventually led to the ousting of the 
country’s president. 


Delay-dependent risk tolerance. Other 
experiments have revealed that when eval- 
uating the costs and benefits of a decision, 
people tend to be more tolerant of risk if 
their decision affects the future rather than 
the present* (see ‘Time warp’). For instance, 
if people are given the choice of receiv- 
ing $100, or either $0 or $200 (with a 50% 
chance of receiving either), many will select 
the $100 if they are told that they will receive 
the money immediately, but the more risky 
option if they will receive the money ina 
year’s time. 

Delay-dependent risk tolerance can 
account for why, globally, insurance cov- 
erage for natural disasters — perceived to 
occur in the distant future — is low rela- 
tive to what is needed to prevent serious 
damage to a country or region’s future eco- 
nomic growth. Between 1960 and 2011, 
nearly 60% of ‘major natural catastrophes, 
those causing at least 100 fatalities and/or 
US$250 million in direct losses (in 2011 
dollars), were uninsured®. Even in high- 
income countries, only 50% of the damage 
resulting from catastrophes, such as earth- 
quakes, tsunamis, floods and so on, were 
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covered by insurance contracts*. 

These findings contrast sharply with the 
popularity of short-term accidental-death 
insurance policies sold at airports, which 
cover flight-related accidents, with the 
policy typically ending on completion of 
the flight. In the late 1950s and early 1960s, 
when such policies were still available at US 
airports, one group of underwriters insured 

potential losses of 


“Many people roughly $651 billion 
tend to be (in 2015 dollars) from 
excessively selling such policies, 
impatient with a profit margin 
inthe short of almost 60%. 

term.” This high tolerance 


of future risks and 
adversity to short-term risk suggests that 
investment in public goods that are relevant 
to climate change — such as tax increases 
to finance a shift to green-energy sources 
— will meet strong opposition from voters 
because the impact of such efforts is likely to 
be invisible for a long time, and the precise 
benefits are uncertain. 


Feedback-dependent risk aversion. Experi- 
ments also indicate that people become 
more averse to risk if they expect to observe 
the outcomes of their decisions over the 
course of time. For instance, in one classic 
study’, participants were asked to allocate 
100 shares to two assets, one more risky 
than the other. Different groups of partici- 
pants — informed about the performance 
of their shares either often or infrequently 
— could reallocate their shares on the basis 
of the feedback they had received. 

Participants who received informa- 
tion more frequently invested nearly 60% 
of their shares in the less risky asset in the 
final round of reallocation, whereas those 
receiving information less often invested 
only about 30% of their shares in the lower- 
risk asset. In this case, the people following 
asset prices more closely are more likely to 
observe losses, and so tend to shy away from 
the riskier investments. 

Such findings indicate that information 
that makes the costs of inaction visible — for 
instance, that global warming increases the 
occurrence of extreme weather events and 
the likelihood of wars — could significantly 
influence people’s behaviour. 


Short-term impatience. Numerous studies 
in behavioural economics and psychology 
have demonstrated that many people tend 
to be excessively impatient in the short term 
and give disproportionate weight to cur- 
rent benefits compared with future ones. 
For instance, if people are given the choice 
of receiving $10 today but $11 tomorrow, 
or $10 in 100 days versus $11 in 101 days, 
many will choose the smaller reward for the 
more immediate payment, but wait an extra 


SOURCE: REF. 4 


day for the larger reward in the second 
scenario. People also tend to be more 
averse to losses in the short term: current 
costs loom large, whereas future poten- 
tial benefits have little bearing on today’s 
decisions. Such behavioural tendencies 
explain why nearly one-third of people 
working in the United States have no 
retirement savings or pension, as found 
by a 2013 survey by the Federal Reserve's 
governors (see go.nature.com/kozp8q). 

One way to deal with this bias is to 
shift benefits and costs across time. In the 
case of pensions, ‘save more tomorrow” 
or ‘automatic escalation’ plans commit 
employees to save a percentage of their 
incomes in the future. The pioneers of 
these schemes, economists Richard Thaler 
and Shlomo Benartzi, have reported that 
about 4.1 million people were participat- 
ing in such plans in 2011, saving an extra 
$7.6 billion per year by 2014. 

The basic principle of committing to 
a policy today but delaying the conse- 
quences is routinely used by politicians, 
for instance to increase retirement age 
without losing voters. This idea could be 
transferred to other arenas. For instance, 
policymakers and governments could com- 
mit some percentage of future increases in 
gross domestic product — and the associ- 
ated tax hikes — to investments that benefit 
future generations. Although dependent on 
future economic growth, this would poten- 
tially increase the political acceptability of 
such investments by reducing the immediate 
burdens on the current generation. 

The feasibility of this idea in relation to 
climate change has already been demon- 
strated. In 2008, people in Zurich, Switzer- 
land, voted overwhelmingly to restrict their 
energy use to the levels of the 1960s by 2050 
(a concept known as the 2,000-watt society). 
Today, Zurich residents use less energy than 
the average person in Switzerland, who in 
turn uses only about half of that used by 
most US residents. 


Failure to carry through on intentions. A 
final insight from behavioural experiments 
and psychology studies is that people often 
fail to act on their intentions, be it because of 
inertia, inattention or the complexity of the 
task at hand. Take, for example, investments 
in energy-efficient appliances. Often, such 
investments provide net benefits to consum- 
ers because reduced operating costs more 
than offset higher purchase prices. How- 
ever, worldwide, there is still a considerable 
gap between people’ actual energy use and 
what it could be if they relied on the most 


FUTURE GENERATIONS 


A Nature special issue 
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TIME WARP 


When people are told that they could get $0 (worst 
possible outcome) or $200 (best possible outcome) with 
varying probabilities now (yellow) or in a year’s time 
(blue), they overestimate the likelihood of the best future 
outcome and underestimate the probability of the worst. 


== t=12 m (outcome in 12 months) 
~~ t=0 (outcome immediately) 


Perceived probability of receiving $200 


Perceived probabilities 
of the best possible 

m outcome become more 
optimistic when a bet 
pays out in the future. 


0) 0.2 04 0.6 08 1.0 
Objective probability of receiving $200 


efficient technology available to them — 
probably because people undervalue future 
cost savings. The consulting firm McKinsey 
estimates that, in the United States, $1.2 tril- 
lion could be saved by 2020 through use of 
more efficient technologies. 

One way to address this ‘resistance to 
switch would be to convey information in 
a form that motivates people to change their 
behaviours. In most developed nations, 
energy labels showing information on run- 
ning costs and efficiency are now provided 
on household appliances. Several small- 
scale field experiments suggest that labelling 
products with such information increases 
the likelihood that people will purchase a 
more efficient product. However, effects are 
small and may depend on the format of the 
information disclosed. 

People’s inertia could prove advantageous 
if, in a set of choices, the option favouring 
the public good requires no action. In many 
countries, the uptake of green-energy con- 
tracts has been 1% or less, even though in 
opinion polls, typically 50-90% of respond- 
ents say that they would use green energy if 
presented with a choice. A different picture 
has emerged in certain areas of Germany 
supplied by the utility company Energie- 
dienst. In the Black Forest town of Schénau 
and in several communities in the south, 
green electricity is offered as the default 
option. And this is what customers use 
unless they take active steps to find a new 
supplier or choose a different contract. A 
2008 study revealed that, in these commu- 
nities, more than 90% of people were using 
energy from renewable sources such as solar 


or wind’, even though in the short-term 
this is more costly than using electricity 
from coal, oil or gas. 

In other words, people’s unwillingness 
or inability to find and evaluate different 
options can be harnessed for the public 
good, while still protecting their freedom 
of choice. 


NURTURE OUR NATURE 

The picture from behavioural and social 
science is clear. Nudges — such as com- 
mitments to save or invest future income 
increases — will be crucial to changing 
behaviour, and should complement con- 
ventional policies. 

In the case of climate change, there 
are three principal take-home messages. 
First, conventional policy tools — such 
as taxing greenhouse-gas emissions — 
will be essential. Second, because people 
cooperate over the long term only if most 
others do as well, all key nations must be 
on board in international agreements’. 
Third, institutions for measuring and 
enforcing compliance — such as meas- 
ures to monitor emissions and to verify 
national reports — will be needed’. 
Self-interest is a powerful force in human 

behaviour. But it is also part of our nature to 
care for others, including people who have 
not yet been born. Many examples in human 
history — from the abolition of slavery and 
the removal of dictatorships to the histori- 
cal decline in violence and enforcement of 
human rights — demonstrate that it is pos- 
sible to mobilize the better angels of our 
nature’ to improve the human condition. m 
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A hyena hunting flamingoes in Kenya. 


EVOLUTION 


Doing the locomotion 


Kevin Padian assesses a study on how animals came to fly, wade, creep and glide. 


ionable guide to the marvels of locomo- 
tory evolution than Matt Wilkinson. In 
Restless Creatures, the zoologist and writer 
rehearses twice-told tales of animals becom- 
ing bilateral, exiting the sea for the land and 
evolving flight, but makes them fresh. These 
are wonderfully adept and informed expla- 
nations of locomotory modes — whether in 
birds, gliding snakes, eels, sharks or a host of 
fossil vertebrates — and there is not a single 
vignette that I failed to learn something from. 
His prose is knowing and wonky enough to 
sound as if it comes from the reincarnation 
of that senior lecturer you had who knew too 
much for most of the undergraduates, but 
was hip and amusing out of class. 
Wilkinson’s thesis is that locomotion 
drives the evolution of all other features. This 
is laid out in his introductory chapter, which 
begins by insulting most people with reli- 
gious sensibilities: “Righteousness apparently 
requires that we take everything for granted” 
He then immediately takes natural selection 
for granted, declaring that “living things are 
as they are largely as a result of the process 
of adaptation” But he provides no demon- 
stration of this, making it seem as if the only 


lE would be hard to find a more compan- 


difference between the 
world views of science 
and religion is taste, 
because the believers 
of neither feelaneed | CRE SS 


: RES 
to test it. 

Wilkinson does | | | 
modify the claim “Us. |) 
that natural selection ; 4 
answers all questions Restless 


Creatures: The 
Story of Life in Ten 


about life by writing 
that it is only “sort 


of true”, because it Movements 
MATT WILKINSON 
neglects the effects Rens 


of mutations and the 

environment. But this is a 1960s view of evo- 
lution, of the kind propounded by Ernst Mayr. 
Yes, natural selection, mutation and environ- 
ment are important. But there are also the 
inherited constraints of development, mate- 
rial properties and other functions of organ- 
isms, such as metabolism and reproduction, 
which are not necessarily optimized by natu- 
ral selection. Sometimes they even work 
against it. The peacock’s tail, for example, has 
long been thought (perhaps erroneously) 
to be a disadvantage in escaping predators 
quickly. The idea that the need for some form 
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of locomotion has driven the whole evolution 
of plants and animals is a risky conceit. 

But a deeper problem pervades Restless 
Creatures. About 40 years ago, evolutionary 
biologists such as Niles Eldredge and Joel 
Cracraft began to set hypotheses about the 
evolution of major features in phylogenetic 
context. Defining and dissecting an adapta- 
tion’s components and then mapping them on 
an independently derived tree of relationships 
help us to visualize the sequence in which 
these traits arose, and therefore to test ideas 
of how functions evolved. We know that to 
fly, an animal does not need hollow bones or 
aperching foot. But it does need a wing capa- 
ble of generating a specific kind of stroke, and 
the energetic and neuromuscular capacity to 
sustain flight. If you can work out in what 
sequence these traits appeared on a phylo- 
genetic tree of the first members of a group, 
then your ideas about how flight evolved can 
become more explicit and testable. 

But none of this is 


in Wilkinson's book. OD NATURE.COM 
The older and ulti- Formoreonscience 
mately unsatisfactory _ in culture see: 
approach that he uses __niafure.com/ 
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assuming) the functional efficiency of a struc- 
ture in a single animal, and then imagining 
how natural selection would have shaped it. 
This is a good approach up to a point, but how 
do you know if you are wrong? One way is to 
check whether your hypothesis matches the 
phylogenetic pattern. But there is not a single 
phylogeny in this book to provide an inde- 
pendent test, even though that would provide 
a pictorial ‘flow chart to clarify the proposed 
direction of evolutionary change. 

In the chapter on flight, on which he has 
published original research, Wilkinson begins 
by discussing some of the physical parameters 
(such as lift and drag) that constrain flight. 
This is straightforward and well written, 
although he does not sufficiently distinguish 
the kinematic and aerodynamic differences 
between gliding, flapping and other modes 
of flight. He assumes that all flying animals 
had gliding ancestors. But gliders such as the 
colugo and flyers such as bats are on com- 
pletely different branches of the vertebrate 
tree — and their ecologies are completely 
different. Moreover, no one has convincingly 
shown how to transform a gliding wing into 
a flapping one, or that this ever happened. It 
cannot be assumed that stiff-limbed, quad- 
rupedal gliders evolved the specialized flight 
stroke. This is why phylogeny matters. 

Wilkinson acknowledges that opinion 
differs on the evolution of feathers, but he 
ignores the literature that has placed early 
feather types and their inferred functions 
— insulation, colour, brooding, inclined 
running and flight — in phylogenetic order 
(see R. O. Prum and A. H. Brush Sci. Am. 
288, 84-93; 2003). He does not explain the 
anatomy and functional morphology of flying 
animals, and pays little attention to kinematic 
studies of living animals that describe the uses 
of forelimbs and feathers (see, for example, 
K. P. Dial et al. Nature 451, 985-990; 2008). 

Restless Creatures runs the gamut from 
microorganisms to climbing primates 
(anthropoids, not clerics), although these are 
presented in no particular order. Wilkinson 
shows how locomotion is intimately related 
to breathing, metabolic rates, habitat and 
developmental genetics in a variety of 
groups. I was pleased that he also discusses 
the locomotion of pollen and seeds, because 
animal biologists so often ignore plants (see 
I. T. Baldwin Nature 522, 282-283; 2015), 
even though they provide an alternate uni- 
verse of evolutionary possibilities. There is a 
great deal in this book that is enjoyable and 
informative, but ultimately, I found it a less 
than satisfying guide to the evolutionary 
biology of locomotory studies. m 
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Books in brief 


Pandemic: Tracking Contagions, from Cholera to Ebola and 
Beyond 

Sonia Shah SARAH CRICHTON (2016) 

Cholera — the acute bacterial infection that can kill in hours — 
serves as a lens on new pandemics in this grounded, bracingly 
intelligent study. As science journalist Sonia Shah reveals, more than 
300 infectious diseases have emerged or re-emerged in the past half- 
century, and epidemiologists predict a catastrophic pandemic in the 
next. Shah lucidly layers history into a tour of transmission hotspots, 
from incubators of ‘spillover’ animal-borne illnesses such as China’s 
wild-animal markets to globalized transport and hyperdense cities. 


The Most Wanted Man in China: My Journey from Scientist to 
Enemy of the State 

Fang Lizhi, translated by Perry Link HENRY HOLT (2016) 

This memoir by late Chinese astrophysicist and dissident Fang 
Lizhi is a trenchant explication of science under siege. Fang learned 
English partly by studying Paul Dirac’s 1930 Principles of Quantum 
Mechanics, and carved out a career at the University of Science and 
Technology of China. Although his youthful love for Communism 
withered during the forced labour and expulsions of the Cultural 
Revolution and beyond, Fang’s “awe at the colossal thing called the 
universe” never waned, surviving surveillance and exile. Inspiring. 


NeuroLogic 

Eliezer J. Sternberg PANTHEON (2016) 

Aman stumbling around in a brightly lit room insists it is dark. When 
ascan shows damage to his brain’s visual monitoring as well as its 
processing system, his internal ‘logic’ is revealed. This case study 
is just one of many marshalled by neurologist Eliezer Sternberg 
for his research-rich study of the neurological circuitry behind 

the narratives we use to make sense of things. Sternberg cracks 
open the brain’s “black box” to examine its parallel conscious and 
unconscious systems, and explores states from dreaming and acts 
on ‘autopilot’ to memory, hallucinations and trauma. 


Privacy: A Short History 

David Vincent POLity (2016) 

Have the reports of privacy’s demise been greatly exaggerated, or 
is it the dodo of our digitized world? Social historian David Vincent 
examines that question in this deft study of privacy in houses, 
cities, correspondence and surveillance, from 1300 to today. We 
peer into the “fugitive spaces” where medievals scratched epistles; 
the Victorian dichotomy of fortress-like abode and sociable public 
transport; and today’s paranoia-soaked debates over digital media. 
Whether linked to ideas of sanctuary, secrecy or intimacy, privacy is 
a flashpoint in the charged relationship of individual to society. 


Snowball in a Blizzard: A Physician’s Notes on Uncertainty in 
Medicine 

Steven C. Hatch Basic (2016) 

How do you pinpoint a tumour ina mammogram? About as easily 
as you find a snowball in a blizzard, writes medical academic 

Steven Hatch in this penetrating examination of uncertainty in 
diagnoses and treatment. It is both constant, he shows, and ignored 
by physicians at their, and their patients’, peril. He also shows why, 
looking in turn at issues such as false positives, mammography, 
hypertension treatments, drug trials and media reportage. Barbara Kiser 
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| COMMENT | BOOKS & ARTS 


HISTORY OF SCIENCE 


When eugenics became law 


Victoria Nourse reviews a study on a historic US misuse of biology, the case of Buck v. Bell. 


ugenics is a well-known low point in 
BR: modern history of science. In the 

United States, from the late nineteenth 
century to the 1940s, credence was given to 
this pseudoscience focused on the notional 
‘improvement of human populations by 
halting the reproduction of supposedly lesser 
genes. Less well known is the story of how US 
law rendered eugenics intellectually respecta- 
ble across the world, supporting programmes 
from Canada to Sweden. Ultimately, this 
egregious failing led to the enforced steri- 
lization of at least 60,000 US citizens, and 
was used by the Nazi regime to justify its 
own programme of sterilization and, later, 
extermination. 

Adam Cohen's Imbeciles relates a key 
chapter in this story, the 1927 US Supreme 
Court case known as Buck v. Bell. The case 
began in September 1924, when Albert 
Priddy, head of the Virginia State Colony 
for Epileptics and Feebleminded, an asylum 
near Lynchburg, asserted that Carrie Buck, a 
teenage mother who had entered the asylum 
that June, was an “imbecile” — a term used 
at the time to signify intellectual disability. 
Priddy petitioned the asylum’s board of 
directors to sterilize Buck. 

Priddy’s petition relied on a Virginia sterili- 
zation law that had become effective months 
earlier, and that Priddy himself had advocated 
for. At the time, almost half of US states had 
eugenics laws, but sterilization was consid- 
ered dubious constitutionally. When Priddy 
died, his successor, John Hendren Bell, lent 
his name to the case. In 1927, the statute’s 
constitutionality was affirmed by the Supreme 
Court, effectively reviving laws that even 
eugenicists had believed were unenforceable. 
Justice Oliver Wendell Holmes, in writ- 
ing the decision, penned the now-famous 
phrase “three generations of imbeciles are 
enough’, referring to Buck, her daughter and 
her mother. All three had been erroneously 
‘diagnosed’ with mental disabilities; Buck’s 
mother was also in the asylum. 

Imbeciles is lively, accessible and, inevita- 
bly, often heart-wrenching. To historians of 
eugenics, much of the background and some 
of the characters will be well known. Eugenics 
champion Harry Laughlin (author of a tome 
on US eugenics law) and his fellow researcher 
Arthur Estabrook, who testified at Buck’s trial, 
are two of them. Others include Priddy and 
Aubrey Strode, the lawyer and legislator who 
wrote the Virginia sterilization law. 

Cohen is at his best when writing about 
Buck. Priddy claimed that she was a sexual 


Carrie Buck (left) and her mother Emma in 1924. 


‘degenerate’ and ‘feebleminded” In fact, she 
was in the asylum because she had been raped 
and made pregnant by a relative of her foster 
parents. Thanks to research by evolutionary 
biologist Stephen Jay Gould and historian 
Paul Lombardo, author of Three Generations, 
No Imbeciles (Johns Hopkins University Press, 
2008), we know that she had no disability. She 
was a poor, isolated victim of a social system 
that branded unmarried mothers as danger- 
ous, and a punitive legal system. 

Cohen's account of the Supreme Court 
trial is illuminating. The law had been 
written to require fair legal procedures for 
the person sterilized, but Buck’s hearing was 
entirely one-sided. Ludicrous testimony was 
admitted about Buck's child, then less than 
a year old. Cohen shows how Bucks lawyer, 
Irving Whitehead, was a fraud and imposter 
by today’s standards, failing to ask questions 
that would help his client or to call witnesses 
on her behalf. 
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procreating. As the fallacious thinking went, a 
sexually active, disabled individual would (in 
Holmes’s words) sap the “strength of the state” 
by creating a lineage of disabled descendants. 

Before Buck v. Bell, eugenic sterilization 
had been advocated for decades by US 
reformers and scientists, including promi- 
nent biologist Charles Davenport, but it 
had been used only sporadically because of 
fears that it was illegal. Eugenics itself was 
born in Britain in the late nineteenth cen- 
tury, nurtured by polymath Francis Galton, 
a half-cousin of Charles Darwin. The con- 
cept resonated with contemporary interpre- 
tations of ‘social Darwinism, which hinged 
on engineering the ‘survival of the fittest’ — a 
gross caricature of Darwin's idea. 

By 1928, a total of 375 US universities and 
colleges were teaching eugenics, and 70% of 
high-school biology textbooks endorsed the 
pseudoscience in some form. Eugenics was 
also endorsed by presidents including Theo- 
dore Roosevelt, funded by philanthropic 
organizations including the Carnegie Institu- 
tion, and touted by award-winning scientists 
such as biologist Edwin Grant Conklin and 
the Nobel laureate Hermann Muller, discov- 
erer of X-ray mutagenesis, as well as promi- 
nent inventors such as Alexander Graham 
Bell. Eugenics came to be seen as the solution 
to everything from hearing loss to criminal- 
ity. In Britain, advocates tended to focus on 
segregation and voluntary sterilization. Major 
British eugenicists included left-leaning sci- 
entists J. B. S. Haldane and Havelock Ellis, 
and supporters included the economist John 
Maynard Keynes, social reformers Sidney and 
Beatrice Webb, and writer H. G. Wells. 

In 1927, a month after her sterilization, 
Buck was released from the asylum as hired 
help, on a kind of parole. Later, she married. 
Her sister Doris was also sterilized; inter- 
views suggest that she thought the operation 
an appendectomy. Imbeciles traces their later 
life in detail, noting one of the most poign- 
ant aspects of the case — Buck's letters to the 
asylum about her relatives and probationary 
status. These, Cohen notes, revealed Buck to 
be intelligent and diligent in trying to contact 
and protect her mother and child, who lived 
with a foster family: a testament to one of the 
most spectacular miscarriages of justice in 
US history. To this day, the Supreme Court 
has never officially overruled Buck v. Bell. = 
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Georgetown University, Washington DC. 
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Starlings’ patterns 
are not spontaneous 


You incorrectly use the flocking 
of starlings in murmurations to 
illustrate the concept of ‘active 
matter’ in determining the 
physics of life (Nature 529, 16-18; 
2016). This behaviour depends 
on information conveyed from 
individuals beyond a bird’s 
immediate neighbours — unlike 
the spontaneous patterns 
formed in active matter from 
independently moving units. 

In addition to information 
about the flight of their near 
neighbours, starlings use their 
location in the flock to adjust 
their murmuration behaviour 
(D.J.G. Pearce et al. Proc. Natl 
Acad. Sci. USA 111, 10422- 
10426; 2014). The flock provides 
this information by projecting 
a ‘shadow’ of its surrounding 
density on each bird’s retina. This 
lets the starling know whether it 
is on the edge of the flock or not, 
and safe or not from predators. 

The starlings’ behaviour 
is more akin to the idea of 
reflexivity (see G. Soros J. Econ. 
Methodol. 20, 309-329; 2013). 
In reflexivity, individuals act in 
response both to the behaviour 
of those immediately around 
them and to the sociological 
effect of interacting as a collective 
entity — in this case, the flock. 
Like humans, a group of starlings 
seems to interact by using non- 
local sociological information to 
shape individual behaviour. 
John Skoyles University College 
London, UK. 
j.skoyles@ucl.ac.uk 


Ancient forest: spare 
it from clearance 


The Bialowieza Forest on the 
borders of Poland and Belarus is 
the last lowland primeval forest 
in Europe that is still governed 
by natural processes. It is now 
under threat from logging on an 
unprecedented scale, proposed 
by the new Polish government 
ostensibly to halt an outbreak 
of European spruce bark 


beetle (Ips typographus). We 
argue strongly for Bialowieza's 
preservation: it provides a 
much-needed ecological 
blueprint for restoring ancient 
forests, and a unique laboratory 
for investigating the effects of 
global change. 

The government's proposed. 
measures are to be included as 
an update to Poland's 10-year 
forest management plan, which 
in 2012 set timber extraction at 
48,000 cubic metres annually. 
This limit was enough to meet the 
needs of local communities and to 
maintain spontaneous ecological 
processes. 

The planned large-scale felling 
and salvage logging of infested 
trees ignores the bark beetle's 
keystone role in shaping the long- 
term dynamics and structure 
of forests (see B. Beudert et al. 
Conserv. Lett. 8, 272-281; 

2015). Moreover, to contain the 
outbreak, it would be necessary to 
log 80% of infested trees (L. Fahse 
and M. Heurich Ecol. Model. 222, 
1833-1846; 2011), which would 
mean violating the 35% of forest 
area that is protected. It is notable 
that only 57% of the stipulated 
harvest refers to spruce, the bark 
beetle’s host tree species. 
Przemystaw Chylarecki 
Museum and Institute of Zoology, 
Polish Academy of Sciences, 
Warsaw, Poland. 

Nuria Selva Institute of Nature 
Conservation, Polish Academy of 
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Ancient forest: keep 
the logging ban 


Poland’s new conservative 

government is planning to lift 

a moratorium on the logging 

of old-growth trees in the 

primeval Bialowieza Forest, 

amid mounting opposition from 

the Polish Academy of Sciences, 

ecologists and activists. 
Divided between Belarus 

and Poland, the 1,500-square- 

kilometre forest is the largest 

remaining part of avast, 

8,000-year-old temperate forest 


that blanketed the European 
lowland (see Nature 455, 
277-280; 2008), and is a World 
Heritage Site. 

The forest supports the 
largest population of European 
bison (Bison bonasus), as well 
as wolves, lynxes and some 
120 species of breeding bird, 
including the lesser-spotted 
eagle (Clanga pomarina) and the 
three-toed woodpecker (Picoides 
tridactylus). 

Poland’s new environment 
minister, Jan Szyszko, backed by 
the logging lobby from Poland’s 
state forests (see go.nature.com/ 
dcjp2x; in Polish), claims that 
the forest is “rotting away”. He 
proposes a 33-fold increase in 
logging. 

Although the changes do 
not affect the small Bialowieza 
National Park directly, the 
forest’s remaining old growth 
will disappear from the buffer 
zones within a couple of years 
and jeopardize this unique 
ecosystem’s sustainability. 
Pawel Michalak Virginia 
Polytechnic Institute and State 
University, Blacksburg, USA. 
pawel@vbi.vt.edu 


Cochrane reviews 
expose bias too 


Cochrane is an international 
network of health professionals, 
patients and others who 
summarize data from research 
studies in systematic reviews 

to help users to make informed 
decisions about disease 
treatment, prevention and 
screening (go.nature.com/ 
z5w5ah). The model we use to 
accomplish this in the Cochrane 
Eyes and Vision (CEV) US group 
can also expose errors of the 
type noted by David Allison and 
colleagues (Nature 530, 27-29; 
2016). 

In this group, clinician authors 
partner with a methodology 
‘hub’ of experts who check data 
and apply validated methods 
in conducting the systematic 
review. These experts are trained 
to identify potential biases and 


statistical errors in reviews, 
and to expose what seem to be 
deliberate reporting biases or 
manipulation of outcomes. 

The experts can identify 
errors in primary studies 
and draw attention to them, 
but cannot correct existing 
errors. Content experts need to 
collaborate with methodologists 
to eliminate mistakes at the 
outset, by designing and doing 
studies at a low risk of bias. 
Journal editors should insist 
that authors adhere to reporting 
standards. And the public 
must demand that health-care 
decisions are underpinned by 
conclusive scientific evidence. 
Kay Dickersin, Tianjing Li 
Johns Hopkins Bloomberg School 
of Public Health, Baltimore, 
Maryland, USA. 
kdicker3@jhu.edu 
Competing financial interests 
declared (see go.nature.com/3 1fu47). 


That’s quite a 
collection 


Your caption under the photo 
of testy entomologist Harrison 
Dyar that accompanies the 
review of Marc Epstein’s 
biography (Nature 529, 152-153; 
2016) credits Dyar with having 
collected “500,000 different 
kinds of mosquitoes over his 
career”. Given that there are 
still only about 3,000 species 
of mosquito known worldwide 
today, this seems a bit ofa 
stretch. 

Dyar may have collected 
500,000 specimens, but 
not 500,000 different kinds. 
Rick Borchelt College Park, 
Maryland, USA. 
rborchelt@gmail.com 


CORRECTION 

The Correspondence ‘No 
myth: benefits of breast 
screening’ (Nature 529, 283; 
2016) omitted a declaration 
of competing interests. This 
has been added to the online 
version. 
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AUTOIMMUNITY 


Antigen-specific immunotherapy 


Nanoparticles coated with fragments of the body’s own proteins are shown to induce T cells of the immune system to adopt 
regulatory functions that suppress autoimmune reactions involving these self-antigens. SEE ARTICLE P.434 


DAVID WRAITH 


utoimmune diseases arise when our 
Aim system attacks our own 

tissues. The immune cells of affected 
individuals are insufficiently tolerant towards 
certain ‘self’ proteins and attack them as if 
they were foreign. Helper T cells (T,, cells) 
play a central part in autoimmune diseases 
because they orchestrate the function of other 
cells in the immune system, including B cells, 
cytotoxic T cells and macrophages. Current 
treatments for autoimmune diseases tend to 
suppress the whole immune system or, at best, 
inhibit the movement or function of T cells; 
such approaches inevitably increase the risk 
of infection and cancer. The ideal treatment 
for autoimmune diseases would convert the 
function of Tj, cells from disease-causing 
to disease-regulating without affecting the 
rest of the immune system. In this issue, 
Clemente-Casares et al.' (page 434) describe 
coated nanoparticles that mediate this con- 
version by binding to receptors on potentially 
self-reactive T cells. 

The authors’ approach can be considered 
as a type of antigen-specific immunotherapy. 
Antigens are the molecular structures that 
induce the activation of T or B cells; for T cells 
these are generally small fragments of proteins 
(peptides). Each T cell can express a different 
surface receptor, thereby allowing our immune 
system to respond to countless different anti- 
gens, including self-antigens. Antigen-specific 
immunotherapy is designed to dampen the 
immune response to a particular antigen or 
set of closely associated antigens. This con- 
cept has been used to treat allergies for more 
than a century’, but specific immunotherapy 
for autoimmune diseases lagged behind until 
the discovery that T,, cells are activated by pep- 
tides bound to MHC class II proteins. This led 
to the design of peptides that selectively target 
T,, cells without risking the activation of self- 
reactive cytotoxic T cells or B cells’. 

How can exposure to a peptide known to 
stimulate self-reactive T,, cells switch off the 
disease they cause? This is best explained by 
the ‘two-signal’ rule of T-cell activation*”. 
All antigens, whether self or foreign, must be 
broken down into peptides, which must then 
bind MHC class II proteins and be displayed at 
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Figure 1 | Coated nanoparticles induce differentiation of regulatory T cells. Clemente-Casares et al.’ 
produced nanoparticles coated with peptide fragments of the body’s own proteins that are associated 

with autoimmune disease, bound to MHC class II proteins. They show that treating mice with these 
nanoparticles modifies the function of T,,1 cells that have receptors specific for that particular peptide: 
instead of inducing an immune response against the self-protein, the T,,1 cells differentiate into regulatory 
(Tx1-like) T cells that secrete the anti-inflammatory protein IL-10. The IL-10 promotes the differentiation 
of B cells into IL-10-secreting regulatory B cells, and also modifies the ability of antigen-presenting cells 
(APCs) to present the specific peptide to immune cells. Furthermore, the T,.1-like cells can inhibit the 
activation of helper (CD4") and cytotoxic (CD8") T cells that are specific for other peptides presented by the 
same APC, and thus mediate bystander suppression. In this way, T,1-like cells can target APCs in tissues 
affected by autoimmune reactions and thereby suppress the inflammation associated with the disease. 


the surface of antigen-presenting cells (APCs) 
to activate T,, cells. This is referred to as 
signal 1. The APC must also upregulate co- 
stimulatory molecules, such as CD80 and 
CD86, to provide the second signal required 
for T,,-cell survival and proliferation. 

What happens when T), cells receive signal 1, 
but not signal 2? Historically, this was thought 
to induce a state of unresponsiveness known 
as anergy®. Now, Clemente-Casares et al. 
show that treating T,, cells with nanoparticles 
coated with a peptide bound to MHC class II 
proteins (pPpMHC-NP treatment) triggers 
signal 1 alone. But rather than simply inducing 
anergy, the treatment drives the T,, cells to 
differentiate into cells that have characteris- 
tics of regulatory T cells; these act to dampen 
immune responses. 
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The resulting regulatory cells exert their 
function by secreting the anti-inflammatory 
proteins IL-10 and TGF-8. Furthermore, the 
cells express the transcription factor T-bet and 
make the cytokine signalling molecule IFN-y 
during their differentiation. These character- 
istics imply that they derive from cells of the 
Ty] subset of T,, cells (Fig. 1). The differen- 
tiation of IL-10-secreting T cells — referred 
to here as T,1-like cells — from T,1 cells is 
an immunoregulatory mechanism known 
to prevent excessive immune responses to a 
range of infections’’. These cells mediate a 
negative feedback mechanism involving sup- 
pression of co-stimulatory molecules on APCs 
and a reduction in the inflammatory proteins 
secreted by APCs”. 

What are the downstream effects of the 


Tal-like cells induced by pMHC-NP treat- 
ment? Clemente-Casares et al. show that 
the cells suppress the function of APCs and 
reinforce immune regulation by promot- 
ing IL-10 production by B cells (Fig. 1). The 
authors verify the specificity of their approach 
by using different experimental models of 
autoimmune disease. PMHC-NPs carrying 
peptides from collagen, an antigen derived 
from joints, suppressed disease in a mouse 
model of rheumatoid arthritis, but not in mice 
with experimental autoimmune encephalitis 
(EAE), a model of multiple sclerosis. Con- 
versely, PMHC-NPs carrying peptides of 
antigens from the central nervous system con- 
trolled EAE but not collagen-induced arthri- 
tis. This confirms that the immune regulation 
induced by pMHC-NP treatment is specific to 
the antigen and tissue, and so to the disease. 

Furthermore, the pMHC-NPs did not need 
to target T cells specific for all peptides in the 
affected organ. Even peptides from sub-domi- 
nant antigens (weaker antigens that do not trig- 
ger disease in the first place) were able to induce 
Tl -like cells that suppressed helper and cyto- 
toxic T cells with activity against other antigens 
(Fig. 1). Thus, although this treatment is highly 
antigen-specific at the induction phase, it can 
influence other arms of the immune response 
locally, through induction of regulatory B-cell 
activity and suppression of helper and cyto- 
toxic T cells specific for different antigens. 
This requires that the peptide fragment from 
the inducing antigen and the other antigens are 
presented by the same APC. 

Is it possible that such bystander suppression 
could lead to systemic immune suppres- 
sion by switching off cells not involved in the 
autoimmune response, thereby increasing 


EPIGENETICS 


the risk of infection or cancer? No: bystander 
suppression will be limited to lymph nodes 
associated with the affected organ and will 
influence only those APCs presenting the 
relevant self-antigen. Such specificity is clearly 
demonstrated by Clemente-Casares and col- 
leagues — mice treated with pMHC-NPs are 
protected against the relevant autoimmune 
disease, yet show undiminished responses to 
infections and foreign antigens. 
The experimental treatments in this study 
use well-character- 
ized models of auto- 


“The treatment immune disease. 
drives the T,, cell But is this work just 
to differentiate another therapeu- 
into cells with tic approach that 
characteristics of | works in mice but 
regulatory T cells; will never work in 
these act to humans? It seems 
dampen immune not: the authors 
responses.” show that pMHC- 

NP treatment leads 


to differentiation 
and proliferation of human T,1-like cells in 
immunodeficient mice transplanted with 
human T and B cells, demonstrating that 
pMHC-NP treatment works on human cells. 
The team’s work also suggests that treatment 
with pMHC-NPs is more effective than with 
monomers of MHC-bound peptides at an 
equivalent dose. Furthermore, pMHC-NP 
treatment seems to be more suppressive than 
the application of peptide alone; however, the 
doses and routes of administration in these tests 
were not comparable. 
There is overwhelming evidence that 
peptide antigens can induce T,1-like cells" 
and suppress autoimmune diseases in both 


A new methyl mark 
on messengers 


The presence of an N' methyl group on adenine bases in DNA and RNA was 
thought to be a form of damage. Results now show that it also occurs at specific 
sites in messenger RNAs, where it affects protein expression. SEE ARTICLE P.441 


ANNA M. KIETRYS & ERIC T. KOOL 


he fact that chemical modifications to 
DNA bases can alter gene expression 
without changing the nucleic-acid 
sequence has been known for more than a 
decade. But the key regulatory function of 
many such epigenetic modifications to mes- 
senger RNA molecules has been recognized 
only recently'*. A simple mark — the methyl 
group — is widely observed in DNA and its 


associated histone proteins, and has been 
studied in mRNA in the forms of 5-methyl- 
cytosine and N°-methyladenosine. On 
page 441 of this issue, Dominissini et al.* 
present a new member of the mRNA methyl- 
marked family, N ‘-methyladenosine, and 
propose that its presence in mRNAs has an 
influence on biological processes. What is 
surprising about this modification is that it has 
previously been described as a form of cellular 
damage’. 
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mice and humans’. The fact that pMHC-NP 
treatment induces T,1-like cells similar to 
those seen after the administration of peptide 
alone suggests that pMHC-NPs mimic the 
APC to which therapeutic peptides bind 
in vivo. The challenge with each of these 
approaches will be to find the optimal dose and 
route of administration for treating people. As 
these options progress towards clinical trials, it 
is vital that their mechanism of action is inves- 
tigated in detail so that patients can benefit 
fully from antigen-specific immunotherapy for 
autoimmune disease. = 
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N'-methyladenosine (m'A) is unusual in 
having a positive charge at physiological pH 
(other bases are uncharged) and a methyl 
group that blocks the Watson—Crick base- 
pairing edge of adenine. The modification 
was previously documented in transfer RNA 
molecules, where it plays a crucial part in 
the formation of tertiary RNA structure. 
The methyl group forces the m'A base to 
pair with a non-Watson-Crick configura- 
tion’, and the positive charge has also been 
hypothesized to exert an electrostatic influ- 
ence on protein interactions’. Dominissini 
et al. propose that this base modification may 
have similar biophysical effects in mRNAs: it 
could affect base-pairing interactions close 
to the site at which protein translation starts, 
and might alter RNA folding and electrostatic 
interactions. 

m’A was previously known to be formed 
by the exposure of RNA molecules to alkylat- 
ing agents, and under alkaline conditions it 
is converted to N °-methyladenosine (m°A) 
through the Dimroth rearrangement’ (Fig. 1). 
This is potentially a big problem in analysis, 
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Figure 1 | Dynamic methylation of adenine. Methylation at the N’ position of the base adenine 
generates the N'-methyladenosine (m'A) modification in RNA molecules, and occurs as a result of 
damage by chemical methylating agents’. Dominissini et al.’ report the presence of m'A in undamaged 
messenger RNA molecules, and propose that it represents a cellular epigenetic mark. In response to 
damage, the repair enzyme ALKBH3 can remove this methyl group to repair the base, but it is not clear if 
this or other enzymes function to remove the mark in an epigenetic context, nor how this mark is added". 
Another form of methylated adenine, m°A, is also recognized as an epigenetic mark in mRNA, added by 
the enzyme METTL3/14 and removed by FTO or ALKBH5. Under alkaline conditions, m'A is converted 


to m°A through the Dimroth rearrangement. 


because m°A is a considerably more abundant 
modification in mRNA, and thus obscures the 
rarer mA. Dominissini et al. found methods to 
suppress this confounding chemical instability 
of m‘A. Through carefully controlled sample 
preparation, they reduced the incidence of 
the m'A-to-m°A rearrangement to less than 
10%, and the use of an isotopically labelled 
internal control during mass-spectrometry 
analysis allowed them to accurately quantify 
m'A. These techniques enabled the research- 
ers to identify the overall amounts of the 
modification in cells, and to map its locations 
inmRNAs. 

The authors find that, although the 
N' methylation of adenine occurs several 
times less often than N° methylation, it is still 
found in thousands of mRNAs. They note an 
enrichment of m‘A signals near the transla- 
tional start site of these mRNAs, and most of 
the molecules analysed were methylated only 
once, suggesting that the group has a single 
role ina given RNA. In the same region, they 
found sequences rich in the bases guanine and 
cytosine; such GC-rich motifs correlate with 
thermodynamically stable secondary RNA 
structure, which could be involved in directing 
the placement of m'A. Moreover, the authors’ 
mapping of m‘A acrossall RNA transcripts in 
the cell (the transcriptome) revealed that the 
modification in mRNAs is associated with 
higher levels of proteins. 

What role does m'A have in the cell? 


Dominissini et al. found that the fraction of 
mRNAs containing m'A correlates positively 
with the level of gene expression, and may 
affect changes in cellular metabolism. The data 
also suggest the presence of a greater number 
of alternative translation initiation sites in the 
methylated RNAs than in RNAs lacking this 
modification. Most notable was the finding 
that m'A-modified RNAs produced 1.7-fold 
higher protein levels than did non-methyl- 
ated ones. Deeper analysis of the translation 
process led the authors to suggest that m'A 
is involved in the processes by which pre- 
mRNA molecules are ‘spliced’ to form mature 
mRNAs. Thus, influencing protein expression 
seems to be one of the strongest effects of this 
methyl mark. 

Importantly, the m’A pattern in yeast RNAs 
differed from that in human and mouse 
RNAs: in yeast, the mark was distributed 
across the coding sequence without preferred 
locations, suggesting that more-sophisticated 
organisms have evolved m'A marking as a 
distinct pathway. Comparing mouse and 
human mRNAs, the authors observed that 
the positions of m'A modification showed 
33% conservation, and that N *_methylation in 
5’-untranslated regions and close to the start 
site exhibited even higher conservation. The 
researchers also demonstrated that the m'A 
level varies significantly between different 
mouse tissues. Furthermore, exposing cultured 
cells to different stress conditions resulted in 
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changes in the m‘A level, which suggests that 
m’A is a dynamic modification with a role 
in cellular stress responses and signalling 
processes. 

A key question raised by these findings 
is how the cell differentiates between dam- 
age and intentional, stable cellular markers. 
When is m'A a ‘bug; and when is it a useful 
feature? Methylation of adenine at position 1 
is well documented to arise from exposure to 
chemical methylating agents’ (Fig. 1), and thus 
1-methyladenine has been widely studied as 
a form of damage in DNA and RNA. Atleast 
one enzyme (ALKBH3) has been documented 
to remove this lesion and reduce its cytotoxic- 
ity’’. A recent study that also documents m'A 
in non-damaged mRNAs shows explicitly that 
this ‘repair’ enzyme can remove much of this 
methyl mark”. It will be important to deter- 
mine how the stable m'A marks recognized as 
being distinct from that resulting from alkyla- 
tion damage, and thus protected from imme- 
diate repair. And if m‘A is truly dynamic, in 
the sense of being specifically placed and 
removed from existing RNA asa cellular switch, 
then what methylase enzyme puts the mark 
in place, and what enzymes — ALKBH3 or 
other — remove it? 

Dominissini and colleagues’ findings repre- 
sent an intriguing step for epitranscriptomics. 
More work is needed to understand the mech- 
anism by which m'A influences translation ini- 
tiation and regulation, as well as the changes 
in its levels in stress responses. It will be excit- 
ing to see how this modification affects, and is 
affected by, RNA structure. Moreover, we need 
to know about the hypothesized proteins that 
are ‘writers, ‘readers’ and ‘erasers’ of m'A and 
thus take part in the dynamics of its regula- 
tion. Finally, as yet another modified base has 
been identified in mRNAs, one wonders how 
many more exist: is this the final one, or are 
we just seeing the tip of the epitranscriptome 
iceberg? = 
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PALAEOANTHROPOLOGY 


What teeth tell us 


Models based on developmental mechanisms described in mice and shared by 
most mammals are shown to accurately predict tooth size in extinct hominins, 
and can explain the small third molars in our species. See Letter p.477 


AIDA GOMEZ-ROBLES 


he third molars of modern humans — 

those often-troublesome molars we 

sometimes call wisdom teeth — are 
frequently very small or do not even develop. 
By contrast, the third molars of other hominin 
species in our evolutionary tree were huge, with 
chewing surfaces that were two to four times 
larger than those in an average modern human. 
This profound change in tooth proportions 
is usually explained by dietary and cultural 
changes considered to be unique to our species. 
But on page 477 of this issue, Evans et al.' offer 
an explanation that may make us feel rather less 
special. The authors propose that the evolution 
of our small third molars may be explained by 
basic developmental mechanisms that we share 
with most mammals. 

In 2007, researchers proposed an inhibitory- 
cascade model of dental development on the 
basis of experimental studies in mice”. Accord- 
ing to this model, dental proportions are estab- 
lished by the relative amount of inhibitory and 
activatory molecules that are expressed as one 
tooth develops after another (the greater the 
net inhibition exerted by an earlier tooth, 
the smaller the size of the later-developing 
tooth). Since then, researchers have explored 
the extent to which this model can explain 
tooth proportions in other mammalian spe- 
cies. Evans and colleagues, alongside others’, 
have extended the application of this model to 
hominins. 

Evans et al. focused on the lower primary 
postcanine teeth (lower milk molars and 
permanent molars). Their results show that 
variation in tooth size and proportions fol- 
lows a remarkably simple rule that differs 
slightly between two major hominin groups. 
In australopiths, the group of African early 
hominins that includes, in this study, the 
genera Ardipithecus, Australopithecus and 
Paranthropus, teeth tend to get bigger towards 
the back of the primary molar row (up to 
the second or third permanent molar), with 
proportions that are constant irrespective 
of overall dental size. However, in the Homo 
genus, the proportional size of teeth varies 
with their overall size: smaller dentitions have 
disproportionately small third molars and 
tend to decrease in size from the first to the 
third molar. 

The beauty of this model is that these simple 
relationships have an astonishing predictive 


power: by knowing only the size of a single 
tooth and the group to which it belongs — 
australopith or Homo — it is possible to infer 
with considerable accuracy the size of the 
remaining primary teeth. The model does 
not always correctly predict the location of 
the largest tooth, but predicted molar sizes are 
impressively close to observed values (Fig. 1). 
Dental anthropologists will be happy about 
this finding, but why should others care? 

Size and proportions are but one aspect of 
the variation in dental anatomy, which can be 
studied using morphometric traits (measur- 
able size and shape) and non-metric traits 
(such as the presence or absence of particular 
cusps and crests). Together with cranial fea- 
tures, dental traits are the bread and butter of 
hominin evolutionary studies, because they 
are thought to contain genetically coded infor- 
mation from which species relationships can 
be reconstructed. However, it is increasingly 
recognized that dental anatomy can vary in 
complex scenarios in which natural selection 
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interacts with developmental and functional 
constraints*; this complexity may make teeth 
less useful for inferring phylogenies than 
we tend to recognize. For example, it is still 
debated whether the megadontia (extremely 
large dental size) observed in the genus Paran- 
thropus is indicative of the evolutionary cohe- 
sion of this group, as is usually assumed, or 
whether it represents a convergent trend that 
was driven by similar ecological and develop- 
mental contexts’. 

At the other extreme of the dental-size spec- 
trum, fossils from the Sima de los Huesos site 
in Atapuerca, Spain, which are dated to more 
than 400,000 years old and are considered to 
be closely related to Neanderthals®, show an 
extreme degree of third-molar reduction’. 
The small third molars of these hominins 
are surprising for a taxon that is ancestral to 
classic Neanderthals, who had larger third 
molars. However, Evans and colleagues’ 
model exactly predicts the strong third-molar 
reduction of Sima de los Huesos hominins as 
a result of their small overall postcanine dental 
size’ (Fig. 1). Itremains to be understood why 
these hominins had such small teeth without 
an equivalent reduction of their faces and jaws. 
It is to be hoped that future experimental work 
will help to unveil models of the interaction 
between teeth and jaws that will explain this 
apparent paradox. 

Because of the paucity of hominin fossils, 
Evans and colleagues simplified their analy- 
sis by using species mean values. However, 
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Figure 1 | Predicting molar size. Evans et al.' present a model that can use the measurement ofa single 
tooth ofa hominin (humans and other members of the human clade since the split from chimpanzees) 
to predict the sizes of its other lower primary postcanine teeth (milk and permanent molars). Depicted 
is a to-scale comparison of observed and predicted permanent-molar sizes in Paranthropus robustus, 
hominins from the Sima de los Huesos site, Neanderthals and modern humans (m1, first molar; 

m2, second molar; m3, third molar). Observed values are from refs 8 and 10, with updated values for 
Sima de los Huesos from ref. 11; predicted values were calculated from the observed m1 size using Evans 
and colleagues’ model (see Supplementary Spreadsheet 1). Following their conventions, red is used for 
australopith hominins and blue for members of the Homo genus; the largest tooth in each series is filled 
in black, along with all teeth that are more than 95% of its size. (Tooth drawings are of modern human 
molars, and not to scale with boxes.) 
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activation and inhibition levels are expected 
to be individual-specific, which may result in 
tooth proportions that differ between indi- 
viduals of the same species. It is therefore pos- 
sible that the rule they describe would not be 
so simple had interindividual variation been 
included in the model. 

Despite this limitation, Evans and col- 
leagues’ paper advances palaeoanthropology 
in three fundamental ways. First, it draws on 
experimental data from mice, which are the 
most common model organism, to explain 
the variation observed in the hominin fossil 
record. Second, it develops a rigorous quanti- 
tative framework to formally test hypotheses 
related to that model. And finally, it improves 
our understanding of the human fossil record 
by identifying evolutionary changes that are 
developmentally linked. 

More importantly, Evans and colleagues’ 
results are relevant beyond the study of fossil 
teeth. Many of the developmental constraints 
that influence dental evolution are shared 
by other systems formed by the repetition 
of serially homologous components, such as 
vertebrae, ribs, limbs and digits. Teeth can 
therefore be useful in identifying develop- 
mental mechanisms operating in these other 
systems’. By extension, the authors’ model has 
the potential to help us understand the evolu- 
tion of human traits that are associated with 
serially homologous structures, including our 
upright posture (which is influenced by ver- 
tebral anatomy), bipedal locomotion (which 
is linked to limb anatomy) and precision grip 
(which depends on the anatomy of digits). As 
with third-molar reduction, we tend to con- 
sider these traits the result of human-specific 
selective pressures, but their evolution is also 
fundamentally channelled by general develop- 
mental rules that humans have not escaped. = 
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CLIMATE SCIENCE 


Hidden trends in the 
ocean carbon sink 


Simulations of the flux of atmospheric carbon dioxide into the ocean show that 
changes in flux associated with human activities are currently masked by natural 
climate variations, but will be evident in the near future. SEE LETTER P.469 


TATIANA ILYINA 


he world ocean has absorbed about 

one-third of the carbon released by 

humans’, and therefore has a key role 
in moderating climate change. Observations’ 
of the ocean interior confirm that increases 
in carbon dioxide emissions from fossil-fuel 
burning are accompanied by an increase in 
carbon content in the upper ocean. But, sur- 
prising as it may seem, McKinley et al.* report 
on page 469 that, in many ocean regions, 
changes in the uptake of CO, induced by 
human activities are currently indistinguish- 
able from changes driven by natural climate 
variations. So, are anthropogenic trends in the 
ocean carbon sink concealed by Earth’s own 
variability? 

As atmospheric CO, levels increase, the 
ocean takes up this gas at a rate proportional to 
the difference of the partial pressure of CO, (a 
measure of CO, concentration in a mixture of 


35) 


gases) between the air and sea’. The strength of 
the ocean carbon sinkis determined by chemi- 
cal reactions in seawater, biological processes 
such as photosynthesis and respiration, and 
physical processes, including ocean circulation 
and vertical mixing’. But even though these 
key mechanisms are known, there are con- 
siderable uncertainties regarding their year- 
to-year (interannual) and decadal variations’. 
These variations are tightly linked to modes 
of internal variability in the climate system — 
such as the El Nifio-Southern Oscillation 
(ENSO) — that have regional to worldwide 
effects on weather and climate, and thereby 
modulate air-sea CO, fluxes and ocean 
biogeochemical cycles. 

Advances in observations and models have 
shed light on how internal climate variability 
controls the ocean carbon sink. Modern Earth- 
system models (ESMs) that were analysed in 
the fifth Coupled Model Intercomparison Pro- 
ject (CMIP5, which compared the output of 
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Figure 1 | Fluxes of carbon dioxide from the atmosphere into the ocean. These graphs compare the 
temporal evolution of the mean annual flux of CO, into the world ocean (in petagrams of carbon per 

year; 1 Pgis 10° grams) predicted by different model simulations, and from observations. Although the 
models and observations reveal a similar trajectory for the global change of flux, the plotted values vary 
considerably because of uncertainties in models and gaps in observations. The model predictions are from 
the fifth Coupled Model Intercomparison Project (CMIP5; blue) and from an ensemble of 100 simulations 
made using the Max Planck Institute Earth System Model (MPI-ESM; red). Solid lines show the average 
of an ensemble of model data; shaded areas show the upper and the lower boundaries of the ensemble 
data. Findings based on observational data’ are in black. McKinley et al.” report simulations suggesting 
that anthropogenic changes in the air-sea carbon flux are currently obscured by naturally occurring flux 
variations. (CO, fluxes into the ocean in CMIP5 models and in the MPI-ESM were calculated by Hongmei Li.) 
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models in which components of Earth's system 
are coupled) consistently predict that ocean 
carbon uptake increases in line with rises in 
fossil-fuel carbon emissions. But these ESMs 
poorly capture natural climate variability. Fur- 
thermore, CMIP5 projections generate a large 
spread in estimates of the ocean carbon sink 
because the models have different numeri- 
cal schemes, process descriptions and spatial 
resolutions®. 

Further complications arise because there 
is a large spread in observed variations of CO, 
fluxes, due to both the use of different map- 
ping procedures and gaps in the observational 
record’. This spread of variation from observa- 
tional data is at least as large as the spread from 
the CMIP5 ensemble of models (Fig. 1). Esti- 
mates of the internal variability of the ocean 
carbon sink therefore remain unconstrained, 
impeding the detection and attribution of 
changes in air-sea CO, fluxes. 

McKinley and co-workers report a crucial 
step towards detecting changes in the ocean 
carbon sink — they have generated a large 
ensemble of simulations based on an ESM. The 
authors repeated their model runs 32 times 
over the same time interval, spanning the years 
1920 to 2100. The model follows historical 
climate evolution until the end of 2005, and 
then a climate-change scenario (known as 
RCP8.5) that projects high levels of atmos- 
pheric CO,. Few modelling centres are able to 
perform so many runs because doing so is com- 
putationally expensive, but the value of using 
ensembles with a high number of simulations 
is increasingly being recognized. 

External forcing factors such as the concen- 
trations of atmospheric greenhouse gases and 
aerosols, volcanic eruptions and solar variabil- 
ity were identical across all simulations. The 
only difference between the ensemble mem- 
bers was their climate state at initialization: 
each simulation started with a different state 
generated by a small perturbation of the air 
temperature. As a result, individual ensemble 
members were not identical to each other, even 
though the model variables for each member 
followed the same general trajectory. 

The authors considered two outcomes from 
the simulations: the forced trend, which is the 
average trend in model variables across all 
ensemble members produced under specified 
external forcing, and the internal trend, which 
is the difference between each member's trend 
and the forced trend, caused by the model’s 
internal variability. They observed that the 
forced trends in ocean carbon uptake are indis- 
tinguishable from internal model variability 
in vast ocean regions between 1990 and 1999. 
When the period between 1990 and 2019 is 
considered, the forced trends become statis- 
tically significant in many more areas than 
during 1990 to 1999, and emerge almost 
everywhere in the ocean. This suggests that the 
predicted increase in oceanic carbon uptake is 
attributable to anthropogenic forcing. These 


trends intensify as atmospheric CO, levels 
increase, and so become detectable ocean- 
wide when the period from 1990 to 2089 is 
considered. 

The researchers confirm that the spatial 
pattern in which trends emerge seems to be 
closely linked to the internal variability of 
the climate system. For instance, the largest 
internal variations are in the equatorial 
Pacific Ocean, a region known to be affected 
by ENSO. In regions such as this that exhibit 
strong seasonal and interannual variability, it 
is difficult to detect anthropogenic changes in 
CO, uptake. By contrast, anthropogenic trends 
emerge early in the subpolar North Atlantic 
and equatorial Atlantic oceans and in some 
regions of the Southern Ocean. A similar 
pattern has been reported* for several ocean 
biogeochemical parameters modelled in the 
CMIP5 ensemble. 

McKinley and colleagues conclude that 
forced trends should be detectable in obser- 
vations once they have emerged and become 
statistically significant. However, although 
internal model variability can give an indica- 
tion of the chaotic behaviour of some natural 
processes, it is not equivalent to Earth’s natural 
climate variability. Furthermore, predictions of 
the time of emergence are model-dependent*”, 
and may change ifa different model or ensem- 
ble size is considered (Fig. 1). 
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Nevertheless, the current study makes a 
valuable contribution to the quantification of 
internal variability in the rates of change of the 
ocean carbon sink. Changes driven by human 
activities are undoubtedly there, but may 
be concealed by natural variations in many 
ocean regions because of the slow timescales 
on which ocean processes occur. Future work 
based on coordinated observational frame- 
works and large ensemble simulations using 
ESMs should enable the natural variability 
reported in this study to be verified. m 
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fast radio burst 


Our understanding of fast radio bursts — intense pulses of radio waves — and 
their use as cosmic probes promises to be transformed now that one burst has 
been associated with a galaxy of known distance from Earth. SEE LETTER P.453 


DUNCAN LORIMER 


of radio waves that last only milliseconds 

and originate from random locations on 
the sky. Although the physical processes that 
cause these pulses are unknown, FRBs hold 
great promise as probes of the cosmic web, a 
major constituent of the large-scale structure of 
the Universe. On page 453 of this issue, Keane 
et al.' report the discovery of a fading radio 
source associated with an FRB (FRB 150418) 
that was found on 18 April 2015. From obser- 
vations with various telescopes, they show 
that FRB 150418 and its fading counterpart 
come from an elliptical galaxy some 1.9 bil- 
lion parsecs (6 billion light years) away. Just as 
happened with y-ray bursts almost 20 years 
ago, knowing both the location of origin and 


f ast radio bursts (FRBs) are bright pulses 


the distance of that location from Earth will 
help to reveal the physical nature of the event 
that caused the FRB. 

A group that I led discovered’ the first FRB 
while searching archival data taken with the 
64-metre Parkes radio telescope in New South 
Wales, Australia. That burst, now known 
as FRB 010724, showed a clear dependence 
of pulse arrival time on frequency, with the 
highest-frequency signals travelling fastest 
through the ionized interstellar medium and 
arriving earlier than their lower-frequency 
counterparts. This well-known dispersion 
effect has allowed astronomers to map out the 
ionized material in the interstellar medium, 
and has helped us to understand the dis- 
tribution of matter in the Milky Way’. For 
FRB 010724, however, the amount of disper- 
sion observed is roughly ten times greater than 
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Host galaxy 


Figure 1 | Redshift of a fast radio burst (FRB). Light increases in wavelength (redshifts) as it travels 


across long distances. a, Here, light leaves a galaxy with an emitted wavelength A, 


The distance between 


em* 


Earth and the galaxy at this time is d.,,. b, When the signal subsequently reaches Earth, the expansion of the 


Universe will have caused the wavelength to redshift by a factor d,,,/d, 


where d,,, is the current distance 


em? 


to the galaxy; the new wavelength is J,,,. From a measurement of ,,, of the host galaxy for FRB 150418 (a 
bright pulse of radio waves that lasted only milliseconds), Keane et al.' determine its redshift and conclude 
that it was emitted when the Universe was about two-thirds of its current size (6 billion years ago). 


that produced by the Milky Way, indicating 
that it is probably of extragalactic origin. Using 
models of intergalactic dispersion, we specu- 
lated’ that FRB 010724 originated around 
0.5 billion parsecs away, and was the first of 
a new population of sources that could act as 
cosmological probes. 

On the basis of the amount of sky coverage 
and time spent by various radio telescopes 
scanning the sky, thousands of FRBs could be 
visible each day’. However, the limited fields 
of view of most radio telescopes mean that 
the pace of subsequent discoveries has been 
less rapid than expected. Currently, 17 FRBs 
are known’, including FRB 150418, but none 
of the distances travelled by them has been 
directly measured. The energy budget of 
FRBs is therefore not well known, and there 
are currently more ideas about the nature of 
FRB sources than there are FRBs themselves. 
Among the possible theories’ are that they are 
collapsing neutron stars, evaporating black 
holes or even non-stellar phenomena such as 
cosmic strings. 

Unlike the FRB discoveries made in archival 
data, FRB 150418 was detected seconds after 
it reached the telescope. Keane et al. could 
therefore carry out follow-up observations of 
the burst’s location on the sky using telescopes 
spanning the electromagnetic spectrum. 
Although almost all of these later observations 
yielded null results, one source was found in 
multiple images obtained at frequencies of 
5.5 and 7.5 gigahertz using the Australia Tele- 
scope Compact Array. This transient radio 
source faded over six days to a level consistent 
with radio emission from a distant galaxy. 

The authors’ statistical analysis shows that 
there is only a tiny probability (less than 0.2%) 
that this source is unrelated to FRB 150418. 
Observations with the Subaru telescope on 
Mauna Kea, Hawaii, revealed an elliptical 
galaxy with a spectrum of optical light that 


originated predominantly from very old 
stars in the Galaxy. By analysing the redshift 
of these lines (the increase in wavelength 
due to the expansion of the Universe), the 
authors conclude that the distance to the galaxy 
and the source of FRB 150418 is 1.8 billion par- 
secs (Fig. 1). 

Knowing the source location of an FRB and 
its distance from Earth provides new con- 
straints on the FRB’s emission process. Keane 
et al. find that the total energy released by the 
FRB is 8 x 10*' joules. This is comparable to the 
energy released by the Sun in two days, despite 
the much shorter duration of FRB 150418. 
The measured pulse width of 0.8 ms is only an 
upper bound on the intrinsic pulse duration, 
because the pulse broadens as it travels the long 
distance through different amounts of ionized 
material along the line of sight. As a result, the 
luminosity of the FRB is at least 10*° watts, or 
about one billion times that of the Sun. 

These properties, together with the fading 
counterpart source and the FRB’s location in 
an elliptical galaxy, strongly suggest models of 
the nature of the source population that involve 
old stellar populations found in elliptical galax- 
ies, rather than much younger spiral galaxies 
such as the Milky Way. Keane et al. argue that 
FRB 150418 originated from the merger of two 
compact stars. I speculate that it represents 
the electromagnetic emission released when a 
binary neutron-star system coalesces’. Such a 
system would emit a large fraction ofits energy 
in the form of gravitational waves, which are 
produced by accelerating bodies, according to 
Einstein’s general theory of relativity. 

Because gravitational waves result from 
the motion of objects, they carry information 
that is complementary to that conveyed by 
electromagnetic waves (light). The search for 
gravitational waves is a major goal of the Laser 
Interferometer Gravitational- Wave Obser- 
vatory® (LIGO), which has just announced’ 
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the first detection of waves produced by the 
inspiralling and merger of two black holes. 
Although the source of FRB 150418 is too 
distant to be detectable by LIGO, closer 
FRB sources might be discovered in the future 
that could emit detectable gravitational-wave 
signals. 

Perhaps the most important of Keane and 
co-workers’ results is their demonstration 
that FRB 150418 can be used as a cosmologi- 
cal probe. The measurement of the pulse’s 
dispersion (the frequency-dependent delay 
of the incoming radio waves) and the source's 
redshift provide an exact count of the num- 
ber of free electrons along the line of sight 
stretching back in cosmic time to the source 
of FRB 150418. The authors show that the 
density of ionized material solves the ‘missing 
baryon’ problem’? — the fact that the amount 
of baryonic (non-dark) matter in galaxies 
is well below that required to explain star 
formation rates, even after accounting for 
dark matter and dark energy. The measure- 
ment provided by FRB 150418 shows, for the 
first time, that the baryonic density is entirely 
consistent with the predictions from standard 
cosmological models. 

There are good reasons to think that 
our understanding of FRBs will increase 
dramatically in the near future. New tele- 
scopes, including the Canadian Hydrogen 
Intensity Mapping Experiment and the Five- 
hundred-meter Aperture Spherical Telescope 
in China, should discover many FRBs. It is rea- 
sonable to expect many hundreds of observed 
FRBs by the end of the decade. A large census 
of FRBs will not only add to our understand- 
ing of their population, but also map out the 
cosmic web in great detail’’, provide stringent 
tests of general relativity’” and even yield new 
constraints on the nature of dark energy”*. = 
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Ancient gene flow from early modern 
humans into Eastern Neanderthals 


Martin Kuhlwilm!*, [lan Gronau2*, Melissa J. Hubisz*, Cesare de Filippo!, Javier Prado-Martinez‘*, Martin Kircher!, 
Qiaomei Ful®’, Hernan A. Burbano!®, Carles Lalueza-Fox‘, Marco de la Rasilla’, Antonio Rosas!’, Pavao Rudan"!, 
Dejana Brajkovic!’, Zeljko Kucan", Ivan Gusic!!, Tomas Marques-Bonet*!*:!4, Aida M. Andrés!, Bence Viola!®", 
Svante Paabo!, Matthias Meyer!, Adam Siepel*” & Sergi Castellano! 


It has been shown that Neanderthals contributed genetically to modern humans outside Africa 47,000-65,000 years ago. 
Here we analyse the genomes of a Neanderthal and a Denisovan from the Altai Mountains in Siberia together with the 
sequences of chromosome 21 of two Neanderthals from Spain and Croatia. We find that a population that diverged early 
from other modern humans in Africa contributed genetically to the ancestors of Neanderthals from the Altai Mountains 
roughly 100,000 years ago. By contrast, we do not detect such a genetic contribution in the Denisovan or the two European 
Neanderthals. We conclude that in addition to later interbreeding events, the ancestors of Neanderthals from the Altai 
Mountains and early modern humans met and interbred, possibly in the Near East, many thousands of years earlier than 


previously thought. 


Based on the fossil record, Neanderthals diverged from modern 
humans at least 430,000 years ago', and the analysis of a Neanderthal 
genome from a cave in the Altai Mountains in Siberia suggests they 
diverged 550,000-765,000 years ago’. The analysis of a Denisovan 
genome from the same cave in the Altai Mountains further suggests 
that Neanderthals and Denisovans diverged 381,000-473,000 years 
ago”. This divergence was followed by admixture among archaic and 
modern human populations, including gene flow from Neanderthals 
into modern humans outside Africa?~*, Denisovan gene flow into the 
ancestors of present-day humans in Oceania and mainland Asia®’, gene 
flow into the Denisovans from Neanderthals? and, possibly, gene flow 
into the Denisovans from an unknown archaic group that diverged 
from the other lineages more than one million years ago”. Genetic 
evidence of gene flow from modern humans into Neanderthals or 
Denisovans, however, remains elusive. 


Divergence and heterozygosity in the archaic genomes 
The Altai Neanderthal genome shares 5.4% more derived alleles with 
present-day Africans than does the Denisovan genome. This excess is 
particularly pronounced for derived alleles found at >0.9 frequency 
in Africans (Extended Data Table 1). These observations have been 
interpreted as evidence of gene flow from an unknown and more deeply 
diverged archaic hominin into the Denisovan lineage”. Here we exam- 
ine whether gene flow from modern humans into the ancestors of the 
Altai Neanderthal may also have occurred. 

Noting that regions in the Denisovan genome introgressed from a 
deeply divergent archaic hominin should have unusually high diver- 
gence to present-day Africans, and that regions of the Altai Neanderthal 
genome introgressed from modern humans should have unusually 


low divergence to them, we examined the divergence of these archaic 
genomes to 504 African genomes® in 15,881 sequence windows of 
100 kb (Supplementary Information section 9). Archaic alleles brought 
into Africa by Eurasians about 3,000 years ago!” were excluded from 
these windows by using only derived alleles at >0.9 frequency in the 
combined African genomes. In the absence of information about the 
phase of the alleles in the two archaic genomes, we calculated their 
divergence to Africans using the archaic alleles in each window that give 
the minimum number of differences, to allow introgressed segments 
from modern humans to be more easily identified, if they exist. Noting 
also that introgressed regions in the Denisovan or Altai Neanderthal 
genome should have unusually high divergence to the other archaic 
genome, we calculated the divergence between the archaic genomes in 
the same windows by using the alleles that give the maximum number 
of differences. 

We find that windows of the Denisovan genome with high diver- 
gence to Africans also have a high divergence to the Altai Neanderthal, 
whereas windows in the Altai Neanderthal genome with high diver- 
gence to Africans do not tend to have a high divergence to the 
Denisovan (Fig. 1a), consistent with gene flow from a deeply diverged 
hominin into the Denisovan ancestors. On the other hand, we find 
that windows of the Altai Neanderthal genome with low divergence 
to Africans have higher divergence to the Denisovan than Denisovan 
windows with low divergence to Africans (Fig. 1a). These windows 
in the Altai Neanderthal genome have higher heterozygosity than in 
the Denisovan genome (Fig. 1b), and 40.7% of their heterozygous 
sites share a derived allele with Africans, whereas 24.2% do so in the 
Denisovan. These observations raise the possibility of gene flow from 
modern humans into Neanderthals. 
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Figure 1 | Divergence and heterozygosity in the Altai Neanderthal and 
Denisovan genomes. a, The maximum divergence between windows in 
the two archaic genomes versus their minimum divergence to Africans. 
Error bars represent the 95% confidence intervals from 1,000 bootstrap 
replicates. Regions previously described as inbred in the Altai Neanderthal 
genome’ were excluded. b, Heterozygosity (per 1,000 bp) in windows of 
each archaic genome versus their minimum divergence to Africans. 

c, d, Simulation of a model with gene flow into the Denisovan lineage 
from both the Altai Neanderthal (0.65%, 50,000 years ago) and an 
unknown archaic hominin (1%, 200,000 years ago) that diverged from 
other hominins 1.5 million years ago. The constant mutation rate used 
makes the slope of the simulated curves less steep than in the actual 
genomes, where mutation rate varies among windows. e, f, Simulation 

of a model that also includes modern human gene flow into the Altai 
Neanderthal lineage (3.55%, 100,000 years ago). 


Model-based inferences of gene flow 

We assessed the possibility of modern human gene flow into the 
Altai Neanderthal lineage using the Generalized Phylogenetic 
Coalescent Sampler (G-PhoCS)!", a Bayesian method for inferring 
divergence times, effective population sizes and rates of gene flow 
from genome sequences. We applied G-PhoCS in five separate anal- 
yses, each considering the Altai Neanderthal and Denisovan genomes 
and two present-day human genomes from an African, European 
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or Asian population (Supplementary Information section 8). We 
modelled gene flow among modern and archaic populations, includ- 
ing gene flow from an unknown deeply divergent archaic popula- 
tion, while accounting for the uncertainty in the ages of the archaic 
individuals. 

The inferred demographic model confirms and provides quantitative 
estimates of previously inferred gene flow events among modern and 
archaic humans”* (Extended Data Fig. 1). These include Neanderthal 
gene flow into modern humans outside Africa (3.3-5.8%) and 
gene flow from an unknown archaic hominin into the ancestors of 
Denisovans (0.0-0.5%). Interestingly, we also detect a signal of gene 
flow from modern humans into the ancestors of the Altai Neanderthal 
(1.0-7.1%). The precise source of this gene flow is unclear, but it 
appears to come from a population that either split from the ancestors 
of all present-day Africans or from one of the early African lineages, 
as significant admixture rates are estimated from San as well as Yoruba 
individuals. This introgression thus occurred in the opposite direction 
from the previously reported gene flow from Neanderthals to modern 
humans outside Africa”. 


Simulation of modern human gene flow 

We used simulations to test if G-PhoCS correctly infers modern human 
gene flow into the Altai Neanderthal lineage (Extended Data Figs 2 and 3) 
and whether the patterns of divergence and heterozygosity observed in 
the Altai Neanderthal genome are expected from our inferred demo- 
graphic model (Extended Data Fig. 1). Using parameters compatible 
with this model, we simulated windows of 100 kb for a model with 
gene flow into the Denisovan lineage from both the Altai Neanderthal 
and a deeply divergent archaic hominin’, and a model including these 
admixture events together with modern human gene flow into the Altai 
Neanderthal lineage. Both models reproduced the observed patterns 
in windows most divergent to Africans (Fig. 1c and d), but only the 
model with modern human gene flow into the ancestors of the Altai 
Neanderthal reproduced the observed divergence and heterozygosity 
patterns in windows of the Altai Neanderthal least divergent to Africans 
(Fig. le and f). 

Present-day human contamination among the DNA fragments 
from the Altai Neanderthal and Denisovan is around 1% (Table 1). 
After genotype calling, which is unaffected by low levels of error, 
these genomes should be largely free from contamination”’. Even 
so, substituting gene flow from modern humans for present-day 
human contamination as high as 5% in the genotypes of the 
Altai Neanderthal fails to explain the observed sequence patterns 
(Extended Data Fig. 4). 


Estimated ages of the introgressed haplotypes 
The majority of haplotypes shared between present-day humans and 
an archaic genome should result from incomplete lineage sorting in 
the population ancestral to them and, thus, be old and short. However, 
if modern human introgression into the Altai Neanderthal lineage 
occurred after its separation from the Denisovan lineage we would 
expect a fraction of these shared haplotypes to be younger and longer 
in the Altai Neanderthal than in the Denisovan genome. 

We examined these shared haplotypes making use of ARGweaver’?, 
a new computational method for sampling full genealogies and cor- 
responding recombination events (ancestral recombination graphs) 
consistent with a collection of genome sequences (Supplementary 
Information section 10). We applied this method to six African 
genomes from three different populations (San, Mbuti, and Yoruba) 
and the two archaic genomes, and estimated the ages of haplotypes for 
which one archaic genome coalesces within the subtree of the African 
genomes more recently than it coalesces with the other archaic genome 
(Fig. 2a, inset). When we compare the age distribution of such ‘African’ 
haplotypes (>50kb), we find that the Altai Neanderthal genome has 
more young ‘African’ haplotypes (Fig. 2a, left) than the Denisovan 
genome (P < 0.01; fraction of MCMC replicates). The majority of 
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Table 1 | The archaic individuals analysed in this work 


Altai El Sidrén Vindija 

Neanderthal? Denisovan’? Neanderthal Neanderthal 
Age >50,000 >50,000 ~49,000 ~44,000 
(years old) 
mtDNA 0.78 0.35 0.40 1.08 
contamination (%) 
Nuclear 0.80 0.22 0.000023 1.12 
contamination (%) 
Genome 
Average coverage 52.7-fold 30.9-fold - - 
Heterozygosity 0.19 0.22 - - 
(per kb) 
Chromosome 21 
DNA enrichment - - 320-fold = 120-fold 
Average coverage 53.7-fold 31.1-fold 14.1-fold 35.9-fold 
Heterozygosity 0.13 0.21 0.24 0.26 
(per kb) 
Cumulative length 10-100 kb 9.68 22.60 20.50 20.50 
of homozygous >100 kb 19 4.80 5.10 5.10 
segments (Mb) 


Radiocarbon dates (uncalibrated), mean contamination estimates for the DNA fragments 
sequenced and summary statistics for the genomes and chromosome 21 sequences. mtDNA, 
mitochondrial DNA. 


these young haplotypes are estimated to coalesce with the African 
genomes 100,000-230,000 years ago, suggesting that they entered 
into the ancestors of the Altai Neanderthal well before the reported 
gene flow from Neanderthals into modern humans outside Africa 
47,000-65,000 years ago’*. Both the cumulative and average length 
of the young ‘Africa’ haplotypes is longer in the Altai Neanderthal 
genome than in the Denisovan genome. 

The introgression from a deeply divergent archaic population 
into the Denisovan lineage is a potential confounding factor in this 
analysis. However, this introgression event should affect older haplo- 
types in the Denisovan genome, rather than the young haplotypes 
examined above. Indeed, we find that the number of haplotypes in 
one archaic genome that coalesce outside Africans and the other 
archaic genome (Fig. 2b, inset) is higher in the Denisovan than in the 
Altai Neanderthal (Fig. 2b, right). Furthermore, the young ‘Africa’ 
haplotypes in the Altai Neanderthal genome do not significantly 
overlap with the older haplotypes in the Denisovan genome and in 
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Figure 2 | Distinguishing between two scenarios of introgression into 
archaic humans. a, The age distribution of ‘African’ haplotypes (>50 kb) 
in the Altai Neanderthal and the Denisovan genomes as inferred by 
ARGweaver. Error bars represent the 95% credible intervals from 302 
Markov chain Monte Carlo (MCMC) replicates. An ‘African’ haplotype 
coalesces within the African subtree before coalescing with the other 
archaic individual (inset), and its age is inferred as that coalescent time 
(arrowhead). The majority of the young ‘Africa’ haplotypes in the Altai 
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simulations ARGweaver only infers them under a model with mod- 
ern human gene flow into the Altai Neanderthal lineage (Extended 
Data Fig. 5). 


Inference of gene flow in European Neanderthals 

To investigate possible differences among Neanderthal populations 
with respect to introgression from modern humans, we designed oligo- 
nucleotide probes" based on the human reference sequence of chromo- 
some 21, and used them to capture’* this chromosome in a Neanderthal 
from Spain (El Sidrén Cave) and a Neanderthal from Croatia (Vindija 
Cave). We estimated their present-day human contamination to be 
around 1% (Table 1). 

We find that the chromosome 21 of the Altai Neanderthal shares 
more derived alleles with Africans than the chromosome 21 of El 
Sidrén (3.5% more) and Vindija (4.9% more) Neanderthals, with the 
European Neanderthals sharing more derived alleles with Africans 
than the chromosome 21 of the Denisovan (9.8% more for El Sidrén, 
8.8% more for Vindija). A comparison of the distribution of haplotype 
ages is not possible with the European Neanderthals, owing to insuf- 
ficient amounts of data, but we compared the cumulative length of 
haplotypes coalescing within the African subtree for each Neanderthal 
lineage. This length is significantly greater for the Altai Neanderthal 
than for the European Neanderthals (P< 0.01; fraction of MCMC rep- 
licates), consistent with introgression from modern humans primarily 
into this Neanderthal lineage. 

When we refine our estimates of gene flow by adding the chromo- 
some 21 sequences of the European Neanderthals to our genome-wide 
data, G-PhoCS infers significant rates of gene flow from Neanderthals 
into modern humans outside Africa only for El Sidrén and Vindija 
Neanderthals (0.3-2.6%) (Fig. 3a), suggesting that Neanderthals 
from Europe are more closely related than the Altai Neanderthal to 
the population that interbred with modern humans outside Africa 
47,000-65,000 years ago’*. Conversely, significant rates of gene flow 
from modern humans into Neanderthals are inferred only into the ances- 
tors of the Altai Neanderthal (0.1-2.1%) (Extended Data Figs 6 and 7). 
This suggests that modern human introgression into Neanderthals 
occurred mainly after the divergence of the Altai Neanderthal from 
El Sidron and Vindija lineages 110,000 (68,000-167,000) years ago 
(Fig. 3b). However, it is possible that the lack of complete genomes from 
the European Neanderthals currently precludes the identification of 
modern human gene flow into them. 

To explore the source of the modern human gene flow among 
the African populations, we simulated three scenarios in which the 
source of the gene flow into the Altai Neanderthal lineage was alter- 
nately an unknown population diverging from the ancestors of all 
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Neanderthal genome are estimated to coalesce 100,000-230,000 years 
ago, with just a few estimated to coalesce less than 100,000 years ago 
(Supplementary Information section 10). b, The age distribution of ‘deep 
ancestral’ haplotypes (>50 kb) in the Altai Neanderthal and Denisovan 
genomes. A ‘deep ancestral’ haplotype coalesces above the African 
subtree and the other archaic lineage (inset), and its age is inferred as that 
coalescent time (arrowhead). ky, thousand years. 
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Figure 3 | Refined demography of archaic and modern humans. a, Total 
migration rates of six gene flow events inferred by G-PhoCS. The ranges 
correspond to 95% Bayesian credible intervals aggregated across runs. Five 
gene flow events have been previously reported, including gene flow from 
an unknown archaic group into Denisovans (blue arrow). In addition, 

we infer gene flow from a population related to modern humans into a 
population ancestral to the Altai Neanderthal (red arrow). It appears to 


present-day Africans, of the San or of Yoruba lineage (Supplementary 
Information section 8). The G-PhoCS estimates from these three 
models are all similar and consistent with those in Fig. 3, and 
thus we cannot distinguish among them. However, it is clear 
that the source of the gene flow is a population equally related to 
present-day Africans and non-Africans (Extended Data Fig. 3). We 
conclude that the introgressing population diverged from other mod- 
ern human populations before or shortly after the split between the 
ancestors of San and other Africans (Fig. 3a), which occurred approx- 
imately 200,000 years ago'’. In agreement, the San, Mbuti and Yoruba 
genomes contribute equally to the young ‘African’ haplotypes in the 
Altai Neanderthal genome (Supplementary Information section 10). 


Introgressed segments in the Altai Neanderthal 

To shed light on possible functional implications of modern 
human gene flow into Neanderthals, we identified 163 putatively 
introgressed segments (>50 kb) in the Altai Neanderthal genome 
(Supplementary Information section 9). These segments have no 
clear affinity to any present-day African population (Extended Data 
Fig. 8), and they overlap with 225 genes. Seven segments exceed 
200 kb (Table 2) and the longest one (309 kb) overlaps with a region 
suspected to have been under positive selection in modern humans’. 
This region has a transcription factor gene (NR5A2) involved in liver 
development'®. One segment of 150kb is located within the FOXP2 
gene (Table 2), which encodes a transcription factor that may be 
relevant for language acquisition'’. 

The number of putatively introgressed segments in the Altai 
Neanderthal decreases in regions of the genome under strong purify- 
ing selection (measured via background selection at linked sites'®), and 
it is lower in the X chromosome compared to the autosomes. Because 
purifying selection purges deleterious alleles and the efficacy of puri- 
fying selection is higher on the X chromosome”’, this may indicate that 
modern human and Neanderthal” alleles were often not tolerated in 
each other’s genetic background. 


Population size in Neanderthals and Denisovans 

Our demographic model suggests a long-term decline in the effective 
population size of Neanderthals and Denisovans since their divergence 
from the ancestors of present-day humans 484,000-640,000 years ago. 
However, the population ancestral to the Vindija Neanderthal appears 
to have expanded (Fig. 3b). In addition, the length distribution of 
homozygous stretches in the European Neanderthals resembles that 
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come from a population that either split from the ancestors of present-day 
Africans or separated fairly early in the history of African populations 
(shaded circle). b, Effective population sizes and divergence times inferred 
by G-PhoCS. The ranges correspond to 95% Bayesian credible intervals 
aggregated across runs. The horizontal bars indicate posterior mean 
estimates for divergence times. Archaic samples (dots) are located at their 
estimated ages. 


of the Denisovan, who lacks a signal of recent inbreeding’, and not 
that of the Altai Neanderthal, whose parents were related at the level 
of half-siblings? (Table 1). Still, the European Neanderthals and the 
Denisovan exhibit signs of a history of mating in small populations”', 
with a larger cumulative length of homozygous segments of 10-100 kb 
than present-day humans and great apes (Fig. 4). In agreement with 
purifying selection being less efficient in small populations, regulatory 
and conserved” regions in Neanderthals have a larger proportion of 
putatively deleterious alleles than present-day humans (Extended Data 
Fig. 9), as shown previously for their protein-coding genes”. 


Discussion 

Our integrated demographic analysis of multiple archaic and present- 
day human genomes suggests a scenario of long-term decline in the 
populations of Neanderthals and Denisovans, with the consistently 
small Altai Neanderthal population perhaps reflecting a long period 
of isolation in the Altai Mountains. In addition, we provide evidence 


Table 2 | Introgressed segments from modern humans into the Altai 
Neanderthal 


Sequence Genetic 
length length 

Genomic region SNPs (bp) (cM) Genes in the region 
Chri: 199,707,795- 161 308,665 0.047 NR5A2; RNU6-609P; 
200,016,460 RNU6-716P; RNU6-778P 
Chr13: 49,532,446- 103 258421 0.040 COX7CP1; FNDC3A; 
49,790,867 OGFOD1P1; RAD17P2; 

RNU6-60P; RNY3P2 
Chr2: 88,815,371- 116 246606 0.023 EIF2AK3; RPIA; TEX37 
89,061,977 
Chr3: 89,790,776- 70 240,761 0.017 - 
90,031,537 
Chr3: 30,590,736- 100 226,070 0.547 GADL1; TGFBR2 
30,816,806 
Chr6: 42,492,777- 67 220446 0.088 ATPE6VOCP3; PRPH2; 
42,713,223 RNU6-890P; TBCC; UBR2 
Chr8: 93,809,505-— 122 201,829 0.070 IRF5P1; TRIQK 
94,011,334 
Chr7: 113,813,987— 37 149,597 0.055 FOXP2 
113,963,584 


The seven segments (>200 kb) in the Altai Neanderthal genome that are enriched in heterozy- 
gous sites with derived alleles at high-frequency in Africans. These sites are homozygous ances- 
tral in the Denisovan. The segment overlapping the FOXP2 gene is also shown. 
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Cumulative length of segments 10-100 kb (Mb) 
Figure 4 | Homozygous segments on chromosome 21. The range of the 
cumulative length (Mb) of homozygous segments is shown as the surface 
of a polygon, with individuals at the extremes of each group’s range serving 
as vertices. Dots represent human individuals, archaic or otherwise, 
whereas great apes are not depicted individually. The Altai Neanderthal 
clusters with the other archaic individuals (inset) when recently inbred 
genomic regions are excluded. 


for modern human introgression into the ancestors of this population 
of Neanderthals, and no such evidence in the European Neanderthals. 
These modern humans may represent a population that diverged early 
from other modern humans in Africa and later met the ancestors of 
the Altai Neanderthal. The finding of ‘African’ haplotypes as young as 
100,000 years old in the Altai Neanderthal genome is consistent with 
interbreeding around that age. 

Hublin™ has proposed that Neanderthals expanded eastward from 
Europe during an interglacial period about 125,000 years ago (Oxygen 
Isotope Stage 5e). The presence of modern humans (at Skhul and Qafzeh) 
and Neanderthals (at Tabun) in the Levant as early as 120,000 years 
ago”>”® provides one place where gene flow from early modern humans 
into Neanderthals could have occurred. Another place is Southern 
Arabia and the area around the Persian Gulf, where modern humans 
may have also settled early”” and Neanderthals are likely to have been 
present*®. The recent demonstration that modern humans may have 
been in China as early as 120,000 years ago” also suggests that modern 
humans migrated early out of Africa. Thus, early modern humans may 
have had the opportunity to admix with archaic hominins before the 
migration of the modern human ancestors of present-day non- Africans. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Data reporting. No statistical methods were used to predetermine sample size. 
The investigators were not blinded to allocation during experiments and outcome 
assessment. 

DNA extraction and library preparation. We prepared DNA extracts from two 
Neanderthal bones, SD1253 from El Sidrén Cave and Vi33.15 from Vindija Cave, 
as described in Rohland et al.*° (Supplementary Table 1), and prepared DNA 
sequencing libraries containing a special 4 base pair (bp) clean-room tag sequence 
to avoid contamination in later steps*!**. During library preparation, we used 
a uracil-DNA-glycosylase (UDG) and endonuclease VIII mix to remove uracils 
resulting from cytosine deaminations®. 

Chromosome 21 capture experiment. We used a strategy previously described’® 
that uses oligonucleotides synthesized on arrays to construct amplified probe 
libraries. We produced a probe library with a tile density of 3 nucleotides across 
the 29.8 Mb of non-repetitive sequences in chromosome 21 (GRCh37/hg19), with 
biotinylated probes similar to those described by Gnirke et al.*4. We used this 
probe library, as previously described”, to generate libraries from El Sidrén and 
Vindija Neanderthals. All libraries were subjected to a second round of ampli- 
fication, followed by two rounds of hybridization capture. Capture eluates were 
amplified, barcoded with two indexes’, pooled, and sequenced on the Genome 
Analyzer IIx (Illumina). 

Contamination estimates. Estimates of present-day human mtDNA contamina- 
tion in El Sidrén and Vindija libraries were previously reported in Castellano et al.”°. 
These contamination estimates were calculated using diagnostic positions at which 
archaic mitochondrial genomes differed from sequences in a panel of 311 pres- 
ent-day human mitochondrial genomes. Nuclear DNA contamination estimates 
were calculated using a previously described maximum likelihood approach’ that 
co-estimates the contamination and sequence error in the autosomes. 
Computational correction of cytosine deaminations. Sequences may carry 
residual cytosine deaminations in the first positions of the 5’ end and in the last 
positions of the 3’ end in spite of the UDG treatment*? (Supplementary Fig. 1). 
These bases are read as thymine and adenosine, respectively. As similarly described 
for the Altai Neanderthal genome’, we decreased the quality to 2 of any “T’ base 
occurring within the first five bases or any ‘A base within the last five positions in 
El Sidron and Vindija sequences. 

Variation discovery. We called Neanderthal genotypes with GATK* and applied 
a previously described set of filters”? (Supplementary Information section 3) 
to obtain high-quality sites for subsequent analyses. We obtained such calls for 
17,014,623 and 20,582,399 sites for El Sidrén and Vindija chromosome 21, respec- 
tively. Genotypes in the Altai Neanderthal, Denisovan and present-day human 
genomes were similarly obtained (Supplementary Table 6), and a combined file 
for all individuals was created and annotated as in Meyer et al.’. Because multiple 
contaminated DNA fragments are needed for a contaminated genotype to be called, 
the proportion of contaminated genotypes is likely to be smaller than the reported 
contamination of 1% among DNA fragments. 

Capture bias. In order to understand capture bias, we captured the chromosome 
21 of the Altai Neanderthal to an average coverage of 46.9-fold. We then down- 
sampled these sequences to assess capture bias at a wider range of average coverage 
from 8.1-fold to 35.7-fold, and did the same for the Altai Neanderthal shotgun 
sequences. The mean reference allele frequency is shifted from 0.52 in the shotgun 
sequences to 0.54-0.55, similar to the observed frequencies in the other archaic 
captured individuals (Supplementary Fig. 4). The mutation spectra after filtering 
do not change with coverage (Supplementary Fig. 2), and differences in allele fre- 
quency at heterozygous sites in the shotgun sequences are small (Supplementary 
Fig. 5). We observed that 3.8-5.2% of heterozygous sites in the shotgun sequences 
of chromosome 21 in the Altai Neanderthal are homozygous in the capture exper- 
iment at coverage from 14-fold to 46.9-fold (Supplementary Table 7). However, 
22.3-45.4% of these heterozygous sites are filtered out, mainly due to low coverage 
in the capture sequences. The same is true for sites that are heterozygous in the 
capture experiment at 46.9-fold coverage, but homozygous (4.9-5.9%) or missing 
(14.1-21.6%) in the shotgun data at 15.1-53.7-fold coverage. In addition, the dis- 
tribution of homozygosity stretches does not differ between the capture and the 
shotgun sequences (Supplementary Fig. 9). We conclude that capture bias does not 
distort our results in a systematic way. 

Sequence patterns. Our analysis of the divergence of the archaic genomes to 
Africans and to each other sought to uncover the patterns that distinguish modern 
human gene flow into the Altai Neanderthal lineage from archaic gene flow into 
the Denisovan lineage. To do this, we analysed 15,881 sequence windows of 100 kb 
in length across the genomes of the two archaic individuals. These windows 
were required to have high-quality genotypes (as described in Supplementary 
Information section 3) in at least 50% of its length in both archaic genomes. 
Because the phase of the archaic alleles is unknown, the divergence of the archaic 
genomes to Africans was calculated using the archaic alleles in each window that 


give the minimum number of differences to derived alleles at >0.9 frequency in 
504 individuals from five African present-day populations (Yoruba, Mende, Luhya, 
Gambian, and Esan)*. Using the minimum divergence to Africans allows intro- 
gressed segments from modern humans to be more easily identified. In contrast, 
the divergence between the archaic genomes was calculated using the archaic alleles 
in each window that give the maximum number of differences. Using the maxi- 
mum divergence between the archaic windows allows introgressed segments in 
either of the two archaic individuals to be more easily identified. Derived alleles 
were determined using the inferred ancestral base in the EPO six-primate align- 
ments”© and the minimum and maximum number of differences in a sequence 
window was divided by its number of high-quality genotypes. Regions of 
the genome described as inbred in the Altai Neanderthal? were excluded from this 
analysis. These are 103 regions >2.5 cM depleted in heterozygous sites. In this 
way, heterozygosity in Fig. 1b could be calculated from the same 15,881 sequence 
windows of 100 kb in Fig. 1a. 

We used the program ms?’ to simulate 15,881 sequence windows of 100 kb 
in length, using parameters that are consistent with the G-PhoCS estimates 
(Supplementary Information section 8). We simulated scenarios with and with- 
out modern human gene flow into the Altai Neanderthal lineage (Supplementary 
Information section 9). The mutation rate of 0.5 x 10°” mutations per bp and 
year*** and an average generation time of 29 years* (as assumed in the G-PhoCS 
inferences) were also used. The number of chromosomes simulated were 1,008 for 
the Africans, two for the Neanderthal, two for the Denisovan, one for the unknown 
archaic, and one for the chimpanzee. 

Alignments at neutral loci. Multiple sequence alignments were obtained for our 
main demography inference using G-PhoCS. Following the guidelines established 
in previous studies'’*°, we extracted multiple sequence alignments of the Altai 
Neanderthal, the Denisovan and multiple present-day humans at 13,753 loci, 1 kb 
long, selected to minimize influence of direct selection, linkage between loci, and 
missing data. Among these, 2,960 loci were selected from chromosome 21, for 
which sequence data was available from El Sidrén and Vindija Neanderthals. 
Demography inference. Our demography inference is based on five main 
G-PhoCS runs, each one containing the Altai Neanderthal, the Denisovan, the 
chimpanzee outgroup (panTro2), and two present day humans from a particular 
population. We considered populations from Africa (Yoruba and San), Europe 
(French), East Asia (China), and Oceania (Papuan). In five additional runs we 
added sequences from chromosome 21 of El Sidron and Vindija Neanderthals. To 
account for the fact that different individuals lived at different times, we modified 
the algorithm to sample the times of the archaic individuals as four additional free 
parameters (Supplementary Information section 8). To validate the robustness of 
our estimates, we conducted additional inferences using subsets of the archaic indi- 
viduals, different subsets of the loci, and allowing for gene flow from an unsampled 
(unknown) divergent human group, and explicitly modelling the source population 
of modern introgression into the ancestors of the Altai Neanderthal as an unsam- 
pled population branching off from the modern human population. 

G-PhoCS setup. In each G-PhoCS run, we ran the Markov chain Monte Carlo 
(MCMC) sampler for 100,000 burn-in iterations and 200,000 subsequent sam- 
pling iterations, and checked manually for convergence of the Markov Chain. The 
samples were used to estimate a posterior mean and 95% Bayesian credible interval 
for each demographic parameter. For parameters common to the five runs with 
different present-day humans, we combined the five parameter traces to obtain 
aggregated estimates. Estimates of population divergence time and effective pop- 
ulation size were calibrated by assuming an average mutation rate of 0.5 x 10° 
per base pair per year*’* and an average generation time of 29 years*’. Estimates 
under different assumptions on mutation rate and generation time are obtained 
by simple scaling of the reported estimates. Gene flow is measured using the total 
migration rate, which is the estimated per-generation rate times the number of 
generations that migration is allowed in the model. 

Simulations. To validate the G-PhoCS inferences we simulated, using ms?’, 
10,000 loci of 1 kb of length for the Altai Neanderthal, Denisovan, three 
present-day humans from San, Yoruba, and European populations and the chim- 
panzee outgroup. Demographic parameters were set according to the ones inferred 
on the genomic sequences, with parameters describing divergence times of modern 
populations and growth of the European population taken from recent studies!!! 
For these individuals, sequences were simulated under different scenarios for mod- 
ern human introgression into the Altai Neanderthal population: (1) no introgres- 
sion; (2) introgression from a population that diverged from present-day humans 
before the San divergence; (3) introgression from a population that diverged from 
the population ancestral only to Yoruba and Europeans; (4) introgression from 
a population that diverged from the population ancestral only to the San; and 
(5) introgression from a population ancestral only to Europeans. These five scenar- 
ios also included simulated gene flow from the Altai Neanderthal and an unsam- 
pled archaic population into the Denisovan population. G-PhoCS was run under 
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each scenario three times (one for each present-day individual) with the same 
settings used in the analysis of the actual genomes. 

ARGweaver analysis. ARGweaver was run using the Altai Neanderthal and 
Denisovan genomes, six modern human genomes (two Yoruba, two San, and 
two Mbuti; Supplementary Table 2), and the chimpanzee reference genome 
(panTro4). Filters were applied to mask regions with uncertain genotype calls. The 
genome was divided into roughly 5 Mb blocks with 1 Mb overlap between adjacent 
blocks. A new method to integrate over genotype phase was used on the archaic 
and present-day human genomes (Supplementary Information section 10). Other 
settings, such as the recombination and mutation rate map, and the population 
size (n= 11,534), were the same as previously reported'*. ARGweaver was run for 
5,000 MCMC iterations, with an ancestral recombination graph sampled every 
20 iterations starting at iteration 2,000. ARGweaver was run similarly with El 
Sidr6n and Vindija chromosome 21 included. ‘Africa’ and ‘deep ancestral’ haplo- 
types were determined in each sampled ancestral recombination graph using only 
a single lineage from each archaic genome to avoid differences in power between 
them due to different levels of heterozygosity and inbreeding. 

Screen for introgressed segments. A screen for modern human introgressed 
segments was performed using the frequency in Africans of derived alleles in sites 
that are heterozygous in one archaic genome (Altai Neanderthal or Denisovan) 
and homozygous ancestral in the other archaic genome. This allows us to identify 
segments that carry an archaic haplotype on three chromosomes, and a human 
haplotype only on one chromosome. Derived alleles were determined using the 
inferred ancestral base in the EPO six-primate alignments*°. Genotypes and allele 
frequencies for the African individuals were obtained from the 1000 Genomes 
project®. We fitted the African derived allele frequencies along each of the archaic 
genomes using a locally weighted polynomial regression (loess function in R), and 
selected those genomic segments containing at least 10 sites where the fitted curve 
to the derived African allele frequencies consistently stayed over a frequency of 
0.25 across 25 kb. Segments containing incompatible sites, that is, sites that 
were derived and shared in both archaic individuals, were removed. In the Altai 
Neanderthal, the average heterozygosity of the putatively introgressed segments 
is 4.9-fold higher than in random genome regions (Supplementary Information 
section 9). 

Homozygosity segments. Homozygous segments were defined as maximal regions 
between two heterozygous positions of length between 10 and 100kb or larger than 
100 kb. To compare the hominin samples with great apes, we masked regions for 
which no data on great apes were available” in addition to the filters described in 
Supplementary Information section 3. 

Prediction of functional consequences. We tested the functional consequences 
of the derived alleles using conservation scores from PhastCons”’. We calculated 
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the fractions of mutations in deleterious sites for the different human groups 
(Supplementary Information section 7). We used annotations of tran- 
scripts from ENSEMBL* to define coding regions, untranslated regions, and 
5,000 bases upstream of transcription start sites and downstream of transcrip- 
tion end sites. We used those as well as conserved transcription factor binding 
sites“ and conserved elements, and sampled randomly for each category the same 
number of bases in neutral sites to calculate the ratio of “functional” to “neutral” 
polymorphism. 


30. Rohland, N. & Hofreiter, M. Comparison and optimization of ancient DNA 
extraction. Biotechniques 42, 343-352 (2007). 

31. Meyer, M. & Kircher, M. Illumina sequencing library preparation for highly 
multiplexed target capture and sequencing. Cold Spring Harb. Protocols 2010, 
http://dx.doi.org/10.1101/pdb.prot5448 (2010). 

32. Kircher, M., Sawyer, S. & Meyer, M. Double indexing overcomes inaccuracies in 
multiplex sequencing on the Illumina platform. Nucleic Acids Res. 40, e3 
(2012). 

33. Briggs, A. W. et al. Removal of deaminated cytosines and detection of in vivo 
methylation in ancient DNA. Nucleic Acids Res. 38, e87 (2010). 

34. Gnirke, A. et al. Solution hybrid selection with ultra-long oligonucleotides for 
massively parallel targeted sequencing. Nature Biotechnol. 27, 182-189 
(2009). 

35. McKenna, A. et al. The Genome Analysis Toolkit: a MapReduce framework for 
analyzing next-generation DNA sequencing data. Genome Res. 20, 1297-1303 
(2010). 

36. Paten, B., Herrero, J., Beal, K., Fitzgerald, S. & Birney, E. Enredo and Pecan: 
genome-wide mammalian consistency-based multiple alignment with 
paralogs. Genome Res. 18, 1814-1828 (2008). 

37. Hudson, R. R. Generating samples under a Wright-Fisher neutral model of 
genetic variation. Bioinformatics 18, 337-338 (2002). 

38. Roach, J. C. et al. Analysis of genetic inheritance in a family quartet by 
whole-genome sequencing. Science 328, 636-639 (2010). 

39. Fenner, J. N. Cross-cultural estimation of the human generation interval for use 
in genetics-based population divergence studies. Am. J. Phys. Anthropol. 128, 
415-423 (2005). 

40. Freedman, A. H. et a/. Genome sequencing highlights the dynamic early history 

of dogs. PLoS Genet. 10, 1004016 (2014). 

Al. Gravel, S. et al. Demographic history and rare allele sharing among human 

populations. Proc. Nat! Acad. Sci. USA 108, 11983-11988 (2011). 

42. Prado-Martinez, J. et a/. Great ape genetic diversity and population history. 

Nature 499, 471-475 (2013). 

43. Durinck, S. et al. BioMart and Bioconductor: a powerful link between biological 

databases and microarray data analysis. Bioinformatics 21, 3439-3440 

(2005). 

44. Arbiza, L. et al. Genome-wide inference of natural selection on human 
transcription factor binding sites. Nature Genet. 45, 723-729 
(2013). 


© 2016 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


Target population 


Chi 


Fre 


Pap 


Source population 
Yor San Pap Chi Fre 


Den 


Alt 


San Yor 


0 09 


18 26 35 44 53 61 7 


total migration rate (%) 


Extended Data Figure 1 | Migration rates in preliminary demographic 
inference. Total migration rates estimated for 22 directional migration 
bands in five separate preliminary G-PhoCS runs. Rows correspond 

to source populations and columns to the target populations. The 

20 migration bands between modern and archaic populations were 
considered in five separate runs, each containing the four bands associated 
with a different modern human population (Supplementary Fig. 15A). The 
two migration bands between the Denisovan and the Altai Neanderthal 
populations were considered in all five runs, and the values shown here 
correspond to an aggregate of all five runs. The estimates are as shown 

in Supplementary Fig. 15B. Shade indicates the posterior mean total 


migration rate (legend), which approximates the probability that a lineage 
in the target population originated in the source population. The 95% 
Bayesian credible intervals from 2,000 MCMC replicates are indicated 
for migration bands whose upper credible interval bound is above 
0.3%. We identified four clusters of migration bands, corresponding to 
what were likely at least four different cases of introgression between 
populations: (1) Neanderthals into non-African modern humans 

(red box), (2) Denisovans into Oceanians (green box), (3) between 
Neanderthals and Denisovans (magenta), and (4) modern humans into 
Neanderthals (blue box). Alt, Altai Neanderthal; Chi, Chinese; Den, 
Denisovan; Fre, French; Pap, Papuan; Yor, Yoruba. 
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Extended Data Figure 2 | Demographic inference on simulated data. 
Simulated data were generated under the demographic model as inferred 
by G-PhoCS (Supplementary Table 13). Each simulated data set consisted 
of 10,000 loci of 1 kb length. We simulated the Altai Neanderthal, the 
Denisovan, and three modern human populations corresponding to the 
San, Yoruba, and French, with modern human demography consistent 
with recent studies (Supplementary information section 8). Three 
migration bands were simulated: (1) from the Altai Neanderthal to the 
Denisovan, (2) from a population that diverged from the ancestors of all 
present-day humans 300,000 years ago into the Altai Neanderthal, and 
(3) from a population that diverged from the ancestors of all modern 

and archaic humans roughly 2.6 million years ago into the Denisovan. 

a, Estimates of effective population sizes (theta, 0), population divergence 
times (tau, T) and migration rates (m) from three G-PhoCS runs on 
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data simulated with gene flow from modern humans into the Altai 
Neanderthal lineage. Each run analyses an individual from a different 
present-day population, using the exact same setup used in our main 
analysis (Supplementary Fig. 15A). Parameters are typically estimated 
accurately, with 95% Bayesian credible intervals containing the values 
used in simulations (horizontal red lines). Rates of archaic gene flow into 
Denisovan appear to be somewhat overestimated, and differences between 
analyses of African and non-African populations are consistent with those 
observed in the data analysis (Supplementary Fig. 15B). b, Similar analysis 
done on data simulated without gene flow from modern humans into the 
Altai Neanderthal lineage. Accurate estimates are obtained for all model 
parameters, and no gene flow is inferred from modern humans into the 
ancestors of the Altai Neanderthal. Error bars represent the 95% Bayesian 
credible intervals from 2,000 MCMC replicates. 
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Extended Data Figure 3 | Simulation of different source populations using different present-day samples: European (French), Yoruba, or San. 
for modern introgression into Neanderthals. Estimated rates of Significant differences in estimates are observed between the data sets 
migration from the modern human population to the Altai Neanderthal with and without gene flow and the data set with gene flow from a source 
population obtained from 15 G-PhoCS runs on five simulated data sets. population related to Europeans. Only minor differences were observed 
All demographic parameters are set according to the values inferred by between values inferred for the three data sets with source population 
G-PhoCS in our data analysis (Supplementary Table 13), and the five data diverging from African populations (San, Yoruba, and ancestral). 
sets differ by the source population for migration: no migration (none), We conclude from this that the source population likely diverged from 
population ancestral to all present-day humans (ancestral), population an African human population before the divergence of present-day 
ancestral to Yoruba and Europeans (Yoruba), San population (San), Eurasians. The shaded circle in Fig. 3a represents this conclusion. 
and European population (European). The first two sets are the ones Error bars represent the 95% Bayesian credible intervals from 2,000 
analysed in Extended Data Fig. 2. Each data set is analysed three times, MCMC replicates. 
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Extended Data Figure 4 | Simulation of present-day human 
contamination. Simulated windows of 100 kb for the Altai Neanderthal 
and Denisovan genomes with present-day human contamination of 5% at 
the genotype level. Windows are binned by their minimum divergence to 
Africans using derived alleles at >0.9 frequency in the simulated African 
population. The x and y axis as in Fig. 1. a, Gene flow from a deeply 
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divergent archaic hominin into the Denisovan lineage (1%) and Altai 
Neanderthal gene flow into the Denisovan lineage (0.65%). b, Gene flow 
from a deeply divergent archaic hominin into the Denisovan lineage, Altai 
Neanderthal gene flow into the Denisovan lineage and modern human 
gene flow into the Altai Neanderthal lineage (1.8%). Error bars represent 
the 95% confidence intervals from 1,000 bootstrap replicates. 
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simulated data. a, Distribution of ‘African haplotype ages in sequences introgressed based on a threshold of < 234,000 years. The counts in 
simulated with introgression into the Altai Neanderthal lineage from each quadrant are for Altai Neanderthal (red) and Denisovan (blue), 
modern humans 100,000 years ago. ‘Africa’ haplotypes are identified as respectively. The counts for the Denisovan in the lower two quadrants 
in Fig. 2. Error bars represent the 95% Bayesian credible intervals from are zero because there was no simulated migration from modern humans 
302 MCMC replicates. b, Distribution of true haplotype ages for each of into the Denisovan lineage. Note that this is a somewhat nonstandard 
the estimated ages. The horizontal dotted lines show the estimated age. plot of true age versus estimated age; a more standard, reversed view is 
The plot is divided into four quadrants; the lower half represents ‘Africa’ given in Supplementary Fig. 33 and demonstrates that the estimated 
haplotypes having true ages between 100,000 and 620,000 years ago (the ages are largely unbiased. Error bars as in the standard Tukey box plot 
divergence time between archaic and present-day humans), which are (R boxplot function). 


necessarily due to post-divergence gene flow from modern humans. 
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Extended Data Figure 6 | Main G-PhoCS demographic inference. 
Summary of the main demographic inference using G-PhoCS in a model 
with four archaic populations and one modern human population. a, The 
population phylogeny assumed in each of the G-PhoCS runs. Labels on 
internal edges indicate names of the four ancestral populations: population 
ancestral to the two Western Neanderthals (W.NEA), population ancestral 
to all three Neanderthals (NEA), population ancestral to all four archaic 
individuals (ARC), and population ancestral to all human samples (HUM). 
We augmented the phylogeny with 14 directional migration bands 
(arrows) between all pairs of sampled populations except for the pairs of 
Neanderthal populations. In one of the runs we added an unknown ‘ghost’ 
population and a migration band from that population into the Denisovan 
population. b, Parameter estimates obtained by G-PhoCS in six separate 
runs analysing 13,754 neutral and loosely linked loci, substituting samples 
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in the ‘Modern’ population with pairs of present-day humans from five 
different modern populations (Supplementary Table 11). The last run has 
gene flow from the ‘ghost’ population and uses two Yoruba individuals 

in the modern human population. Bar heights indicate posterior mean 
and error bars correspond to 95% Bayesian credible intervals. Estimates 
of divergence times (T) and effective population sizes (0) are given in raw 
form, scaled by number of mutations per 10 kb (left axis), and calibrated 
to absolute units, 1,000 years for time, and 1,000 individuals for effective 
population size, (right axis) assuming an average mutation rate of 

0.5 x 10-° mutations per year per bp and an average generation time of 
29 years. For each of the 14 migration bands, we are showing the estimated 
total migration rates (m). See Supplementary Information section 8 for 
more information on parameter calibration and setup for G-PhoCS. 

A graphical summary of these estimates is given in Fig. 3. 
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bands in five separate G-PhoCS runs. Rows correspond to source MCMC replicates are indicated for migration bands whose upper credible 
populations and columns to the target populations. The 40 migration interval bound is above 0.3%. We identified four clusters of migration 
bands between modern (present-day) and archaic populations were bands, corresponding to what were likely at least four different cases of 
considered in five separate runs, each containing the eight bands introgression between populations: (1) Western (European) Neanderthals 
associated with a different modern human population (Extended Data into non-African modern humans (red box), (2) Denisovans into East 
Fig. 6a). The six migration bands between the Denisovan population and Asian and Oceanians (green box), (3) Neanderthals into Denisovans 
the three Neanderthal populations were considered in all six runs, and (magenta), and (4) modern humans into Eastern Neanderthals (blue box). 
the values shown here were estimated as an aggregate of all five runs. Directed arrows in Fig. 3a depict these introgression events. Sid, El Sidrén 
The estimates are as shown in Extended Data Fig. 6. Shade indicates Neanderthal; Vin, Vindija Neanderthal. 


the posterior mean total migration rate (legend), which approximates 
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Extended Data Figure 9 | Natural selection in chromosome 21. Ratio 
of functional (putatively deleterious) to neutral polymorphism in archaic 
and present-day humans (Supplementary Information section 7). TFBS, 
transcription factor binding sites; upstream refers to 5 kb before the 
transcription start site of genes; UTRs, untranslated regions (in the 
mRNA). PhastCons > 0.9 for a site to be used. Ne, Neanderthals; 

Af, Africans; As, Asians; Am, Americans. 
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Extended Data Table 1 | Shared derived alleles 


African frequency Osfs0.1 0.1<f<0.2 0.2<f<0.3 0.3<f504 O04<f<505 O0.5<f<06 O6<f<0.7 O.7<f<0.8 0.8<f<s0.9 0.9<f<s 1] Fixed 
Altai (%) 66.86 5.01 3.89 3.42 3.11 2.79 2.71 2.55 2.68 6.99 2.98 


Denisovan (%) 72.27 4.35 3.43 2.96 2.60 2.5 2.26 2.12 2.23 5.30 1.90 


Percentage of derived alleles in one archaic genome (Altai Neanderthal or Denisovan) shared with African genomes in sites that are homozygous ancestral in the other archaic genome. 
The percentages of shared derived alleles are binned by their African allele frequency. Fixed derived alleles in Africans are included in the '0.9 <f< 1’ but also shown separately in the fixed’ category. 
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Expanding antigen-specific regulatory 
networks to treat autoimmunity 


Xavier Clemente-Casares!, Jesus Blanco?**, Poornima Ambalavanan!*, Jun Yamanouchi!*, Santiswarup Singha!*, Cesar Fandos?, 
Sue Tsai!, Jinguo Wang!, Nahir Garabatos*, Cristina Izquierdo*, Smriti Agrawal®, Michael B. Keough®, V. Wee Yong®, Eddie James°, 


Anna Moore’, Yang Yang!®, Thomas Stratmann‘, Pau Serra* & Pere Santamaria 


1,2 


Regulatory T cells hold promise as targets for therapeutic intervention in autoimmunity, but approaches capable of 
expanding antigen-specific regulatory T cells in vivo are currently not available. Here we show that systemic delivery of 
nanoparticles coated with autoimmune-disease-relevant peptides bound to major histocompatibility complex class II 
(pMHCII) molecules triggers the generation and expansion of antigen-specific regulatory CD4* T cell type 1 (Trl)-like 
cells in different mouse models, including mice humanized with lymphocytes from patients, leading to resolution of 
established autoimmune phenomena. Ten pMHCII-based nanomedicines show similar biological effects, regardless of 
genetic background, prevalence of the cognate T-cell population or MHC restriction. These nanomedicines promote the 
differentiation of disease-primed autoreactive T cells into Tgl-like cells, which in turn suppress autoantigen- loaded 
antigen-presenting cells and drive the differentiation of cognate B cells into disease-suppressing regulatory B cells, 
without compromising systemic immunity. pMHCII-based nanomedicines thus represent a new class of drugs, potentially 
useful for treating a broad spectrum of autoimmune conditions in a disease-specific manner. 


Autoimmune diseases such as type 1 diabetes (T1D), multiple sclerosis 
and rheumatoid arthritis result from chronic autoimmune responses 
involving T cells and B cells recognizing numerous antigenic epitopes 
on incompletely defined lists of autoantigens’. Eliminating or sup- 
pressing all polyclonal autoreactive T-cell specificities (known and 
unknown) in each individual autoimmune disorder without compro- 
mising systemic immunity is not currently possible. 

Adoptive transfer of polyclonal FOXP3*CD4*CD25* regulatory 
T (Tyeg) cells expanded ex vivo has been proposed as an alternative ther- 
apeutic approach’. The potential for bystander immunosuppression, 
the lack of effective strategies for expanding antigen-specific Treg cells 
in vitro, and the lineage instability of FOXP3* Treg cells, have hindered 
the clinical translation of this approach>-’. Tpl FOXP3~CD4*CD25~ 
T cells, which produce the cytokines IL-10 and IL-21, and express 
the surface markers CD49b and LAG-3 and the transcription factor 
c-Maf®, constitute another regulatory T-cell subset recently exploited 
for the treatment of human inflammatory diseases?!!. However, as 
with FOXP3* Treg cells, there are no pharmacological approaches that 
can expand autoantigen- or disease-specific Tg1-like cells in vivo. 

We recently discovered that systemic delivery of nanoparticles (NPs) 
coated with T1D-relevant pMHC class I complexes could blunt the 
progression of T1D by expanding subsets of CD8* T cells with reg- 
ulatory potential but conventional memory-like phenotype”. As this 
outcome could be replicated with different pMHC class I complexes, 
we reasoned that pMHC-NP therapy may utilize a naturally occur- 
ring negative feedback regulatory loop, whereby chronic autoantigenic 
exposure (and exposure to pPMHC-NPs) would trigger the differen- 
tiation of autoreactive T cells into regulatory T-cell progeny. By this 
reasoning, we predicted that NPs coated with disease-relevant pMHCII 


complexes might be able to expand disease-specific regulatory CD4* 
T cells in vivo. 


Expansion of disease-specific Tl cells 

We treated non-obese diabetic (NOD) and NOD Foxp3-eGFP mice 
(expressing enhanced green fluorescent protein (eGFP) under the con- 
trol of the mouse Foxp3 promoter) with uncoated nanoparticles or nan- 
oparticles coated with a pMHC, 2.5mi/IA®’ (ref. 13), recognized by the 
diabetogenic BDC2.5-specific T-cell receptor (TCR), or with 2.5mi/IA®” 
monomers. Nanoparticles coated in 2.5mi/IA®’ induced expansion of 
cognate CD4* T cells in blood and spleens of all mice (Fig. 1a, b). These 
cells had a memory-like (CD44"'1CD62L») FOXP3~ Trl-like pheno- 
type, expressing ICOS, latent-associated TGF-B and the Ta] markers 
CD49b and LAG-3 (Fig. 1c, d). A similar outcome was observed in mice 
treated with 2.5mi/IA®’—NPs upon depletion of CD4+CD25* T cells 
(Extended Data Fig. 1a). Unlike their tetramer~ counterparts, these 
cells proliferated and secreted IL-10 and to a lesser extent IFNy, but not 
IL-2, IL-4 or IL-17, in response to dendritic cells (DCs) pulsed with the 
2.5mi peptide (Extended Data Fig. 1b). Real-time reverse-transcription 
(RT)-PCR analyses confirmed the Tg1-like phenotype of these cells 
(Supplementary Table 1). 

To determine if pMHCII-NPs could directly trigger Tp1 marker 
and IL-10 expression on cognate CD4* T cells, we cultured naive and 
anti-CD3 plus anti-CD28 monoclonal antibody (mAb)-preactivated 
2.5mi/IA®’-tetramer*CD4* T cells from BDC2.5-TCR-transgenic 
NOD Foxp3-eGFP or NOD 1110“ mice (carrying an eGFP insertion 
in the 1/10 locus)" in the presence of 2.5mi/IA8’—NPs, 2.5mi peptide or 
2.5mi/IA®’” monomer. Naive T cells expressed neither CD49b nor LAG-3, 
even after incubation with 2.5mi/IA8’—NPs, 2.5mi/IA8’ monomer 
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Figure 1 | pMHC-NPs relevant for T1D or EAE expand cognate 
disease-suppressing Tp1-like CD4* T cells in vivo. a, b, Tetramer- 
staining profiles (a) and percentages of tetramerCD4* T cells (b). 

Data correspond to pre-diabetic NOD females treated for 5 weeks 
(blood: n=5, 8 and 6; spleen: n=5, 18 and 6, respectively). Tet, tetramer. 
c, Tetramer-staining of splenic CD4* T cells from treated or untreated 
NOD Foxp3-eGFP mice. d, The tetramer*CD4¢ T cells of 2.5mi/IA®’- 
NP-treated mice display a Ta1-like phenotype. e, Incidence of diabetes 
in T-cell-reconstituted NOD scid hosts transfused with CD4* T cells 
from different donors + 2.5mi/IA%’—NPs (n= 11, 5, 7 and 6 from top). 

f, Percentages of tetramer*CD4* T cells in 2.5mi/IA®’—NP-treated or 
untreated NOD scid hosts (n = 4-5 per group). g, Incidence of disease 
reversal in diabetic mice treated with pMHC-NPs (n =9, 7, 7, 7 from top 


or 2.5mi peptide (Extended Data Fig. 1c, d). However, preactivated 
T cells upregulated both markers as well as eGFP (IL-10) only in 
response to 2.5mi/IA®’-NPs (Extended Data Fig. 1d, e). In agree- 
ment with this, expression of IL-10 in NOD 1110” mice treated with 
2.5mi/IA8’-NPs was largely restricted to the CD49b*LAG-3*CD4* 
subset (Extended Data Fig. 1f). 

In vitro, the tetramer’ CD4* T cells of pMHC-NP-treated mice 
suppressed the proliferation of non-cognate (islet-specific glucose- 
6-phosphatase catalytic subunit-related protein (IGRP)- or LCMV 
Gp33-specific) CD8* T cells in response to peptide-pulsed DCs, in 
an IL-10- and TGF-$-dependent manner (Extended Data Fig. 1g). 
In vivo, splenic CD4* T cells from donors treated with pMHC-NPs 
suppressed diabetes development in T-cell-reconstituted NOD scid (also 
known as NOD Prkdc*“'*) hosts (Fig. le), an effect that was potentiated 
by treating hosts with pMHC-NPs (Figs le, f). 

We next investigated whether 2.5mi/IA2’-NPs or NPs coated with 
IGRP4_22/TA®” or IGRP 428-1 45/IA®’, targeted by sub-dominant pools 
of autoreactive CD4* T cells'®, could restore normoglycaemia in 
diabetic NOD mice. Unlike mice treated with nanoparticles coated 
with hen egg-white lysozyme (HEL) 4-29/IA®’, 90-100% of the mice 
that received nanoparticles coated with 2.5mi/IA®’, IGRP4_39/IA®’ or 
IGRP j0g-145/IA®” reverted to stable normoglycaemia (Fig. 1g, Extended 


left to right), or IGRP4_) peptide! and IGRP4_y) peptide-NP (n = 9). 

h, Percentage of tetramer*CD4* T cells in diabetic mice at onset, 

in response to 2.5mi/IA®’—NP therapy and age-matched non-diabetic 
controls (n= 8, 6, 2 and 7 from left). i, Insulitis scores (n = 6, 4, 3 and 

6 from left). Bottom, representative images. j-m, C57BL/6 mice were 
immunized with pMOG35;_5s. j, EAE scores of mice treated from day 14 
(n=4 each). k, EAE scores of mice treated from day 21 (n= 10, 7 and 3 
from top). 1, Percentage of tetramer*CD4* T cells in spleen and blood 
of mice from j and k (n= 13, 14and 5 from top). m, Representative flow 
profiles of CD4* T cells from mice in j and k. n, Representative microglial 
IBA1 stainings and relative rank scores in the cerebellum of mice from k 
(n=4-5). P values were calculated via Mann-Whitney U-test, log-rank 
(Mantel-Cox) test or two-way ANOVA. Error bars, s.e.m. 


Data Fig. 1h) and displayed systemic expansion of cognate T1-like 
T cells (Fig. 1h, Extended Data Fig. 2a-g). Treatment with peptide'® or 
peptide-coated nanoparticles but without MHC could not reproduce 
any of these effects (Fig. 1g, Extended Data Figs 1h and 2h). Treatment 
withdrawal resulted in loss of the normoglycaemic state in 25-60% 
of mice (Extended Data Fig. 1i), in association with the loss of the 
tetramer*CD4* T-cell pools (Fig. 1h, Extended Data Fig. 2a). The 
animals that maintained normoglycaemia had normal postprandial 
serum insulin levels, fasting glucose tolerance (Extended Data Fig. 1j-m) 
and reduced insulitis (Fig. li). In addition, their pancreatic lymph 
nodes (PLNs) could not support the proliferation of carboxyfluores- 
cein succinimidyl ester (CFSE)-labelled IGRP296_214/ Kt specific CD8* 
T cells in vivo (Extended Data Fig. 1n). 

We next tested the ability of nanoparticles coated with myelin oli- 
godendrocyte glycoprotein (pMOG)s3s_49/IA? to blunt the progression 
of pMOG3s_55-induced experimental autoimmune encephalomyelitis 
(EAE, a model of multiple sclerosis) in C57BL/6 mice. PMOG38_49/ 
IA>_NP therapy dampended disease progression when given on day 
14 after immunization and restored motor function in paralytic mice 
when given on day 21 (Fig. 1), k). These effects were mirrored by weight 
gain, and were associated with systemic expansion of cognate Tp1-like 
T cells, reductions in activated macrophage/microglia in the cerebellum, 
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fewer inflammatory foci and areas of demyelination in the white matter 
of the cerebellum and decreased demyelination of the spinal cord 
(Fig. 1l-n, Extended Data Figs 3a-f). Similar therapeutic effects 
were seen in HLA-DR4-IE-transgenic C57BL/6 IAb™" mice (MHCII 
knockout mice expressing a transgenic hybrid MHCII molecule com- 
posed of the peptide-binding domain of human HLA-DR4 and the 
membrane-proximal domain of mouse IE (DR4-IE)) immunized with 
human (h) proteolipid protein (hPLP)175_192 or hMOGo7_i0g peptides 
and treated with hPLP}75_192/DR4-IE or hMOGo7_109g/DR4-IE-NPs 
upon developing limb paralysis (Extended Data Figs 4a-d). 


Disease and organ specificity 

Studies in another autoimmune disease model, collagen-induced 
arthritis (CIA), showed that nanoparticles displaying mouse collagen 
(mCII)259-273/DR4-IE could reduce joint inflammation in arthritic 
HLA-DR4-IE-transgenic C57BL/10.M mice in association with 
systemic expansions of cognate T1-like T cells (Figs 2a—-e, Extended 


a 
Ak MCll p55, 579/DR4-NPs 


Data Fig. 4e). In contrast, nanoparticles coated with hMOGo7-10/ 
DR4-IE complexes had no effect (Fig. 2a-c) 

To investigate further the disease-specificity of pMHC-NP ther- 
apy, we induced EAE in C57BL/6 [Ab™" HLA-DR4-IE-transgenic 
mice by immunization with hPLP}75_192 and treated diseased mice with 
hPLP75_192/DR4-IE-NPs (positive control), uncoated nanoparticles 
(negative control), EAE-relevant hMOGo7_193/DR4-IE-NPs (which 
display a peptide from a CNS autoantigen other than that used to 
induce disease), or CIA-relevant mCII259-273/DR4-IE—-NPs. Whereas 
hMOGoy7-103/DR4-IE-NPs blunted EAE as efficiently as the positive 
control, mCIH59-273/DR4-IE-NPs had no therapeutic activity (Fig. 2f, 
Extended Data Fig. 4f, g). Here, too, therapeutic activity was associ- 
ated with systemic expansions of cognate Tp1-like T cells (Fig. 2g, h). 
Administration of mCIIy59_273 peptide!® or of mCII359_273-peptide- 
coated microparticles (MPs) to arthritic C57BL/10.M HLA-DR4-IE- 
transgenic mice (Fig. 2a—d, Extended Data Fig. 4e), or of hMOGo7_ 108 
peptide, hMOGo7-10s/DR4-IE monomers or hMOGo7-103-coated 
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Figure 2 | Therapeutic effects are disease-specific and dependent 

on both pMHC and nanoparticles. a—f, C57BL/10.M HLA-DR4-IE- 
transgenic mice immunized with bovine collagen. a, Left, changes in joint 
swelling (top) and clinical scores (bottom) in response to uncoated NPs, 
pMHC-NPs, peptide s.c.'° or peptide-coated MPs i.v!’. Treatment was 
initiated when joint swelling reached 130% of baseline (data normalized 
to the initiation of treatment (100% value)) (n = 4, 4, 4 and 8 from top). 
Right, percentage increase in joint swelling relative to pre-immunization 
baseline (100% value). b, Representative haematoxylin and eosin (first 
row) and O-safranin/fast-green/haematoxylin (second and third rows) 
knee joint staining images. Third row shows enlarged images of lacunae 
on the bone and meniscal articular surfaces. 1, panus formation; 2, cellular 
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Treatment 


175-192 


infiltration of the meniscus; 3, bone erosion; 4, proteoglycan depletion; 

5, loss of chondrocyte/lacunnae. c, Average pathology scores (n = 3-4 

per group). d, Percentage of tetramer*CD4* T cells. e, Representative 
flow cytometry profiles for Tal markers in mCIIy59_273/DR4-NP-treated. 
f-h, C57BL/6 [Ab™" HLA-DR4-IE-transgenic mice immunized with hPLP. 
f, Changes in EAE scores ((n=5, 4, 13 (4-9 per group), 5, 19 (4-5 per group, 
see also Extended Data Fig. 4h), 4 and 5 from top). g, Percentage of 
tetramer’ CD4* T cells in the spleen of mice from f (n=4, 5, 4, 6, 15, 3 and 
3 from left). h, Representative flow cytometry profiles for Tp1 markers. 
Data were compared using Mann-Whitney U-test or two-way ANOVA. 
Error bars, s.e.m. 
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nanoparticles or microparticles to C57BL/6 IAb™! HLA-DR4-IE- 
transgenic mice failed to both expand cognate T1-like cells and blunt 
disease (Fig. 2f, g, Extended Data Fig. 4f-i). Thus, the biological and 
therapeutic effects of pMHCII-NPs are disease-specific and dissoci- 
ated from the pathogenic role of epitopes (disease-triggering versus 
downstream autoantigenic targets), suggesting that these compounds 
act on pre-activated autoreactive T cells and can generate Tr1-like cell 
expansions from rare T-cell precursor pools. 


Soluble mediators 

Blockade of IL-10, TGF-8 and IL-21R (but not IFN‘) abrogated the 
anti-diabetogenic properties of 2.5mi/IA®’—NPs or IGRP4_9/IA8’— 
NPs (Fig. 3a, Extended Data Fig. 5a). With the exception of IL-21R 
blockade (known to inhibit CD8* T-cell activation), these interven- 
tions also abrogated the suppression of autoantigen crosspresentation 
by the pMHC-NP-expanded Tpl-like T cells in the PLNs (Extended 
Data Figs 5b, c, 6). Studies using diabetic NOD Ifng’/~ and NOD 
I110~'~ mice revealed that development of the Tl precursors and/ 
or Tpl-like cells that expand in response to therapy requires IFNy in 
addition to IL-10 (Extended Data Figs 5d-f, 6). mAbs against IL-10, 
TGF-( and IL-21R also abrogated the anti-encephalitogenic activity of 
hPLP}75_192/DR4-IE-NPs in C57BL/6 [Ab™! HLA-DR4-IE-transgenic 
mice (Extended Data Fig. 5g-j). pMOG35_s5-immunized C57BL/6 
1127r~/~ mice responded to pMOG3g_ 49/IA>—NPs like their wild-type 
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counterparts (Fig. 1j-n, Extended Data Fig. 5k—n). Thus, IFNy and 
IL-10, but not IL-27'%, are necessary for pMHC-NP-induced Txl-like 
cell development; and autoreactive Ta1-like T cells use IL-10, TGF-8 
and IL-21 (but not IFNy) to suppress disease. 


Downstream effectors and network formation 
The PLNs (but not the mesenteric lymph nodes (MLNs) or spleens) 
of pMHC-NP-treated NOD mice harboured increased percentages of 
B cells compared with the PLNs of mice treated with pMHCII-NPs not 
relevant for T1D (Extended Data Fig. 50). Studies of mice treated with 
a range of pMHC-NP doses revealed that the sizes of the PLN (but not 
splenic) B-cell and Tp1-like cell pools were correlated (Extended Data 
Fig. 5p). Unlike their splenic or MLN counterparts, the PLN B cells 
of these mice could not effectively present peptide to cognate CD4* 
T cells ex vivo (Extended Data Fig. 5q). In addition, these cells produced 
IL-10 in response to lipopolysaccharide (LPS) (Extended Data Fig. 5r), 
suggesting that pMHC-NP-induced Tzl-like cells might trigger the 
formation and expansion of regulatory B (Byeg) cells in the PLNs. In fact, 
2.5mi-pulsed B cells, but not DCs, underwent expansion in 2.5mi/TA8’”— 
NP-treated hosts within a week of transfer (Extended Data Fig. 5s, t). 
To probe this further, we transfused NOD 110°” splenic B cells that 
were either pulsed with 2.5mi or a negative control peptide (GPI2g2-292); 
into 2.5mi/IA8’-NP-treated NOD or NOD I110~/~ hosts. Seven days 
later, the hosts were analysed for the presence of IL-10-producing 
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Figure 3 | Disease reversal involves effects of Tpl cytokines on cognate 
B cells and local CD11b* cells, without compromising systemic 
immunity. a, Blood glucose levels in diabetic NOD mice treated with 
2.5mi/IA8’—-NPs and blocking antibodies (n = 8, 4, 6, 6, 5 and 4 from 

top to right). b, Expression of IL-10 (eGFP) and upregulation of CD5 

and CD1d by eGFP~ 2.5mi-pulsed splenic B cells from NOD I/10°!? 
donors in 2.5mi/IA8’-NP-treated NOD hosts. c, Averaged results from b 
(n=4, 3, 3 and 7 from left). d, Incidence of diabetes in T-cell-reconstituted 
NOD scid hosts left alone or transfused with PLN CD19* cells (n=7, 13 
and 7 from top). e, Incidence of diabetes in T-cell-reconstituted NOD scid 
hosts transfused with CD19* and/or CD4* cells (n=7, 6, 3, 7, 8, 1l and 13 


treated untreated 


from top). f, Cytokine and chemokine profiles of PLN and MLN CD11b* 
cells from 2.5mi/IA®’-NP-treated NOD mice in response to LPS (n = 3-4 
each). g, Percentage of tetramer*CD4* T cells in the spleens (left), and 
viral titres in the ovaries (right) of treated compared with untreated NOD 
mice 4 and 14 days after vaccinia virus infection (n =3 per group). 

h, Percentages of tetramer*CD4" T cells in the spleens (left) and serum 
anti-dinitrophenyl (DNP) antibody titres (right) in treated and untreated 
NOD mice immunized with keyhole limpet haemocyanin (KLH)-DNP 
(n= 3-5 per group). Data were compared using Mann-Whitney U-test, 
log-rank test or two-way ANOVA. Error bars, s.e.m. 
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(eGFP*) CD5*CD1d"#" B cells. NOD (but not NOD I/10~'~) mice 
treated with 2.5mi/IA8’—NPs efficiently induced formation of Breg cells 
specifically from 2.5mi-pulsed B cells, and IL-21R but not IL-10 or TGF-8 
blockade suppressed this effect (Fig. 3b, c, Extended Data Fig. 5u). 

In vitro, the PLN B cells of 2.5mi/IA8’—-NP-treated mice had a 
moderate suppressive effect on the proliferative activity of BDC2.5 
CD4* T cells in response to peptide-pulsed DCs (Extended Data 
Fig. 5v). In vivo, these B cells suppressed diabetes development in 
T-cell-reconstituted NOD scid hosts as compared to PLN B cells from 
control mice (Fig. 3d). Co-transfer of PLN B cells and bulk or 2.5mi/ 
IA®’-tetramer* splenic CD4* T cells from 2.5mi/IA®’—NP-treated mice 
resulted in >95% suppression, as compared to PLN B cells with or 
without tetramer-CD4* T cells from 2.5mi/IA2”-NP-treated mice, to 
CD4* T cells with or without MLN B cells from 2.5mi/IA®’—NP-treated 
mice (~40%), or to CD4* T cells from untreated or control-NP-treated 
mice (0%) (Fig. 3e), supporting synergistic effects. In agreement 
with this, treatment of newly diabetic NOD mice with a B-cell- 
depleting anti-CD20 mAb abrogated the anti-diabetogenic activity of 
2.5mi/IA®’-NPs (Fig. 3a, Extended Data Fig. 5x). 

Comparison of the cytokine and chemokine profile of CD11b* cells 
derived from the PLN or MLN of pMHC-NP-treated NOD mice fur- 
ther revealed that CD11b* cells from the PLN produced lower levels of 
the pro-inflammatory mediators IL-3, IL-17, IL-6, IFNy, CXCL9 and 
CXCL10 in response to LPS than their MLN counterparts did (Fig. 3f). 
Importantly, the effects of pMHC-NP therapy on antigen-presenting 
cells (APC)s from draining lymph nodes were not associated with 
impaired systemic immunity because pMHC-NP-treated NOD mice 


cleared an acute viral infection and mounted antibodies against an 
exogenous antigen as efficiently as untreated mice (Figs 3g, h). 


Antigen- experienced T cells as targets 

The memory-like phenotype and the upregulation of T-bet mRNA in 
the expanded Tx1-like cells, coupled with the inability of pMHC-NPs 
to expand cognate Tgl-like cells in non-diseased mice or NOD Ifng /~ 
mice suggested that pMHC-NPs expand pre-existing Tl cells that 
arise from antigen-experienced precursors; and/or trigger the differ- 
entiation of antigen-experienced Ty] cells into Tg1-like cells. Indeed, 
whereas diabetic NOD G6pc2-'~ mice (which lack IGRP) responded 
to 2.5mi/IA®’—NPs like wild-type NOD mice, they did not respond 
to IGRP4_22/IA®’-NPs (Figs 4a, b). In vitro, 2.5mi/IA®’—-NPs triggered 
the expression of CD49b and LAG-3 and the upregulation of c-maf, 
1121, 1110 and Lag3 mRNA exclusively in anti-CD3 plus anti-CD28 
mAb-activated, but not naive BDC2.5 CD4* T cells (Fig. 4c, Extended 
Data Fig. 1d). 

To investigate this further, we transfused naive (CD44'°’CD62L"%) 
or memory-like (CD44*'CD62L”) BDC2.5 CD4* T cells into hosts of 
the congenic NOD.Thy1* strain and measured changes in their expres- 
sion of LAG-3 and CD49b protein and c-maf, 1121, 1110, Ifng, Lag3 and 
Cd49b mRNA, both upon 2.5mi/IA8’—NP therapy and in the absence of 
therapy. Notably, the memory T cells from pMHC-NP- untreated hosts 
expressed about one hundred-fold higher levels of c-mafand [121 and, 
toa lesser extent, Lag3 and Cd49b, but not 110 mRNA than their naive 
counterparts (Fig. 4d). This is in accordance with the observed demeth- 
ylation of 1/21 and the c-Maf/IL-10- and IL-21-expression competency 
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Figure 4 | The Ta1-like CD4* T cells arising in response to pMHCII-NPs 
are derived from antigen-experienced precursors. a, Percentage of 
tetramer*CD4* T cells in hyperglycaemic NOD G6pc2~'~ compared with 
NOD mice treated with IGRP4_32/IA®’— (n = 4 and 7) or 2.5mi/IA®”—NPs 
(n=6 and 9). b, Blood glucose levels in hyperglycaemic NOD 

G6pc2"'" mice in response to pMHC-NP therapy (n= 4-6 per group). 

c, Upregulation of Tpl transcripts by anti-CD3/anti-CD28 mAb-activated 
eGFP CD4°* T cells from BDC2.5 NOD Foxp3-eGFP mice in response to 
different in vitro stimuli (n =4 mice each). d, Changes in Tg1-relevant 
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transcripts in naive or memory BDC2.5 CD4* T cells in response to 
2.5mi/IA8’-NPs in vivo (n=6, 6, 5 and 4 from left). e, LAG-3 and CD49b 
profiles (blue; compared with isotype control in red) of Thy1>* cells from d. 
f, Proliferation of CFSE-labelled memory BDC2.5 CD4* T cells in 

NOD. Thy!‘ hosts in response to 2.5mi/IA®’—NPs. g, Incidence of diabetes 
in T-cell-reconstituted NOD scid hosts transfused with naive or memory 
BDC2.5 CD4* T cells and treated with bi-weekly doses of 2.5mi/IA8”—NPs 
(n=4 and 3) or left untreated (n = 4 and 6). P values were calculated via 
Mann-Whitney U-test or log-rank (Mantel-Cox) tests. Error bars, s.e.m. 
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of effector/memory CD4* T cells'*”’, and suggests that the memory 
T-cell pool is enriched for uncommitted Tg] precursors, expressing a 
Trl-poised transcriptional program. Remarkably, whereas pMHC-NP 
therapy only upregulated Lag3 mRNA and, to a lesser extent, LAG-3 
protein in naive BDC2.5 CD4* T cells, it promoted the upregulation 
of 1110 mRNA and LAG-3 and CD49b protein, and the proliferation 
of memory BDC2.5 CD4* T cells (Fig. 4d-f). Similar results were 
observed using memory eGFP” (FOXP3~) BDC2.5 CD4° cells from 
BDC2.5 NOD Foxp3-eGFP mice (Extended Data Fig. 5y). 

These effects on antigen-experienced T cells were accompanied by 
acquisition of anti-diabetogenic properties: whereas pMHC-NP therapy 
afforded 100% diabetes protection to T-cell-reconstituted NOD scid hosts 
bearing memory BDC2.5 T cells, therapy was inconsequential in hosts 
receiving naive BDC2.5 T cells (Fig. 4g). Therefore, pMHC-NP ther- 
apy promotes the differentiation (and expansion) of c-Maf-expressing 
antigen-experienced CD4* T cells into Tal progeny. 


Translational potential 

We determined the ability of human T1D-relevant pMHCII-NPs to 
expand cognate Tkl-like T cells in NOD scid Tl2rg! ~ (NSG) hosts recon- 
stituted with peripheral blood mononuclear cells (PBMCs) from T1D 
patients (Supplementary Table 2). Initial assay development focused 
on NSG hosts reconstituted with PBMCs from five DRB1*0401+ 
recent-onset T1D patients and treated with nanoparticles coated in either 
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human glutamic acid decarboxylase-65 (GAD65)555-567(r5571)/ DR4 
or preproinsulin (PPI)76-90(xsgs)/DR4 (Fig. 5a, b, Supplementary Table 2). 
We then repeated these experiments using NSG hosts reconstituted 
with PBMCs from 7 DRB1*0301* T1D patients and a third T1D- 
relevant pMHC-NP type (hIGRP13-25/DR3-NPs) given at a higher 
dose. We saw expansion of tetramer*CD49btLAG-3*CD4" T cells in 
the spleen and/or PLNs (endogenous mouse (m)IGRP}3_25 is highly 
homologous to hIGRP}3_25) from all seven pMHC-NP-treated mice 
and none of the untreated controls (Fig. 5c, d, Supplementary Table 2, 
Extended Data Fig. 7a). The average percentage and numbers of 
tetramertCD4* T cells in IGRP13_25/DR3-NP-treated mice were signif- 
icantly greater than in untreated littermates (Fig. 5d) and expressed J110 
mRNA (Fig. 5e). These responses could not be induced with peptide 
or peptide-coated nanoparticles or microparticles (Fig. 5d, Extended 
Data Fig. 7b and Supplementary Table 2). 

The PLNs of the pMHC-NP-treated mice that harboured increased 
percentages of tetramer'CD4* T cells had increased cellularity 
(Fig. 5f). Furthermore, there were correlations between the number of 
PLN tetramer*CD4* T cells and the percentage and absolute number 
of PLN human B cells, and the PLN B cells, unlike their splenic coun- 
terparts, produced IL-10 in response to LPS (Fig. 5g-i), suggesting Breg 
formation and/or recruitment. No such responses were seen in patient 
hPBMC-reconstituted NSG mice treated with peptide or peptide-NPs/ 
MPs (Fig. 5f). 
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Figure 5 | Human T1D-relevant pMHC-NPs expand cognate Trl-like 
CD4* T cells in human PBMC-engrafted NSG hosts. a, Expansion 

of cognate CcD4* T cells by GAD%555-567(5571)/DR4—NPs (top) or 
PPI76-90s8s)/DR4-NPs (bottom) in NSG mice engrafted with PBMCs 
from DR4* T1D patients. b, CD49b and LAG-3 marker expression on the 
sample at the bottom of a. c, Expansion of cognate Tl1-like CD4* T cells 
in NSG mice engrafted with PBMCs from DR3* T1D patients in response 
to IGRP}3_25/DR3-NP-therapy. d, Percentages (left) and numbers (right) 
of tetramer*CD4* T cells in mice engrafted with T1D PBMCs in 
response to treatment (n for spleen and PLN per treatment = 9/6, 7/6 

and 14/1 from left legend). e, Expression of [110 mRNA in IGRP}3_25/DR3 


No. of IGRP,, 9./DR3 tet* cells (<10™) 


tetramer*CD4* T cells from mice treated with IGRP}3_5/DR3-—NPs 

(n=3 each). f, The PLNs of responder mice contained increased numbers 
of lymphocytes compared to the other groups (n =6, 3, 4, 3 from top 
legend). g, h, Correlation between the absolute numbers of IGRP}3_25/DR3 
tetramer” cells in the PLNs (g) or spleen (h) and the percentage or 
absolute number of PLN or splenic B cells in IGRP3-25/DR3-NP-treated 
mice (n= 6 and 7). i, Secretion of IL-10 by LPS-stimulated CD19* cells 
(ex vivo, for 24h) isolated from the PLNs or spleens of hPBMC-engrafted 
NSG mice treated with IGRP}3_25/DR3-NPs (n = 3 each). P values were 
calculated by Mann-Whitney U-test or Pearson correlation test. 

Error bars, s.e.m. 
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Discussion 
We have shown that systemic therapy with nanoparticles coated with 
autoimmune-disease-relevant pMHC class II complexes triggers the 
expansion of cognate Tr1-like CD4* T cells, restores normoglycaemia 
in spontaneously diabetic mice and motor function in paralyzed EAE 
mice, and resolves joint swelling and destruction in arthritic mice, 
without compromising systemic immunity. We demonstrate that this 
outcome is dissociated from genetic background and type of autoimmune 
disease and can be replicated with ten different human or murine 
autoimmune-disease-relevant pMHC-NP types. The cell surface 
phenotype, cytokine secretion pattern, transcriptional profile and function 
of the Tgl-like cell pools expanded by pMHCII-based nanomedicines 
are consistent with those described for murine Trl-like CD4* T cells 
and remarkably similar to Tal cells derived from healthy individuals 
and autoimmune disease patients®. We demonstrate key roles for prior 
autoantigenic experience and IFN~- and IL-10-expression competence 
in the developmental biology of autoreactive Tp1 cells. We show that 
pMHCII-NPs promote IL-10 transcription and the upregulation of Tal 
markers in Tl -poised, antigen-experienced CD4* T cells in an APC- 
and IL-27-independent manner, followed by systemic expansion. The 
need for IFN, the expression of the Ty1 transcription factor T-bet, the 
c-Maf/IL-10- and IL-21-expression competency of effector and memory 
CD4* T cells!®-?°, and the ability of pMHCII-NPs to turn T cells primed 
by active immunization into Tl suppressors suggest that these Tal 
precursors are effector/memory Ty1 cells. We define the mechanisms 
of action and uncover a cascade of cellular interactions downstream of 
the pMHC-NP-expanded Txl-like cells, including B,g cell formation, 
that coordinately lead to the resolution of inflammation in an antigen- 
dependent but antigen-non-specific manner (Extended Data Fig. 8). 
Collectively, our data support the contention that any single pMHC 
involved in a given autoimmune disease could be used to blunt com- 
plex autoimmune responses via this approach. Consistent with this 
prediction, the 20 pMHCI/II-based nanomedicines tested to date 
have similar efficacy, regardless of antigen prevalence, dominance or 
role in the disease process. Neither pMHC monomers nor peptides or 
peptide-coated nanoparticles/microparticles trigger cognate Trl cell 
formation/expansion from the polyclonal T-cell repertoires or reverse 
T1D, CIA or EAE in the chronic models tested here. pMHC-based 
nanomedicines thus represent a new class of therapeutics in autoim- 
munity, capable of resolving cellularly and antigenically complex auto- 
immune responses in a disease- and organ-specific manner without 
compromising systemic immunity. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 

Mice. NOD/Ltj, NOD scid, BDC2.5-NOD, NOD I110~'~, C57BL/6, C57BL/6 
1127r-'~, C57BL/10.M, NOD Foxp3-egfp and NOD scid Tlarg! ~(NSG) mice were 
purchased from the Jackson Lab. NOD Ifng-/~ and LCMV Gp33-specific TCR- 
transgenic NOD mice were from D. Serreze (Jackson Lab). HLA-DR4-IE-transgenic 
C57BL/6 [Ab""! mice were purchased from Taconic Farms. NOD 110°? (tiger) 
mice were obtained by backcrossing the 110°” allele from C57BL/6 110%” 
mice (Jackson Lab) onto the NOD/Ltj background for 10 generations. 8.3-NOD 
and NOD G6pc2"- mice have been described elsewhere”**, These studies were 
approved by the corresponding institutional animal care committees. No statistical 
methods were used to predetermine sample size. 

Antibodies, tetramer staining and flow cytometry. FITC, PE, PerCP or biotin- 
conjugated mAbs against mouse CD4 (RM4-5), CD8«a (53-6.7), B220 (RA36B2), 
CD62L (MEL-1), CD69 (H1.2F3), CD44 (IM7), and CD49b (DXS5) and streptavi- 
din-PerCP were purchased from BD Pharmingen. The antibody against 
murine LAG-3 (C9B7W) was from eBioscience. Anti-latent-associated-TGF-8 
antibody (TW7-16B4) was from BioLegend. PE-conjugated pMHC class II 
tetramers were prepared using biotinylated pMHC monomers. Peripheral 
blood mononuclear cells, splenocytes, lymph node and bone marrow CD4* 
T cells were incubated with avidin for 15 min at room temperature and stained with 
tetramer (51g ml!) in FACS buffer (0.05% sodium azide and 1% FBS in PBS) for 
30-120 min at 4°C or 37°C, depending on the tetramer, washed, and incubated 
with FITC-conjugated anti-CD4 (5\.g ml~') and PerCP-conjugated anti-B220 
(2pg ml~}; asa ‘dump’ channel) for 30 min at 4°C. Cells were washed, fixed in 1% 
paraformaldehyde (PFA) in PBS and analysed with FACScan, FACSaria, or BD 
LSRII flow cytometers. For other phenotypic analyses, single-cell suspensions were 
stained with pMHC tetramers and antibodies diluted 1:100 in FACS buffer (all used 
at 4°C except anti-LAG-3, which was used at 37 °C), washed, fixed in 1% PFA, and 
analysed by FACS. All phenotypic staining were performed in the presence of an 
anti-CD16/CD32 mAb (2.4G2; BD Pharmingen) to block Fc receptors. Analysis 
was done using FlowJo software. 

NSG-engrafted human T cells were analysed using the following mAbs: FITC- 
conjugated anti-CD4 (OKT4, BioLegend), APC-conjugated anti-CD19 (HIB19, BD 
Pharmingen), PerCP-conjugated polyclonal goat anti-LAG-3 IgG (R&D Systems), 
biotin-conjugated anti-CD49b (AK7, Pierce Antibodies, Thermo Scientific), and 
EF450-conjugated streptavidin (eBioscience). Briefly, splenocytes and pancreatic 
lymph node cells were incubated with avidin (0.25 mg ml”! in FACS buffer) for 
30 min at room temperature, washed and stained with tetramer (51g ml~!) for 1h 
at 37°C, washed, and incubated with FITC-conjugated anti-CD4 (2/100), APC- 
conjugated anti-CD19 (5/100; used as a ‘dump’ channel), PerCP-conjugated anti- 
LAG-3 (8/100) and biotin-conjugated anti-CD49b (4/100) at 4°C for 45 min. After 
washing, the cells were incubated with EF450-conjugated streptavidin for 30 min 
at 4°C, washed, fixed in 1% PFA in PBS and cells within the hCD4*/hCD19~ gate 
analysed with a FACSCanto II (BD Bioscience). 

Peptides and pMHCs. Unless specified otherwise, recombinant pMHC class II 
monomers were purified from culture supernatants of induced Drosophila SC2 cells 
transfected with constructs encoding I-A8 and I-Aq chains carrying c-Jun or c-Fos 
leucine zippers, respectively, and a BirA and 6x His tags. In these constructs, the 
peptide-coding sequence was tethered to the amino-terminal end of the I-A8 chain 
via a flexible Gly-Ser linker as described'?. GAD65555(5571)-567/DR4, PPI76-90(sgs)/ 
DR4 and IGRP}3_25/DR3 monomers were produced by loading the corresponding 
peptides onto DR4 and DR3 complexes purified from supernatants of induced SC2 
cells, as described”*. Other constructs (those encoding 2.5mi/ TA8”, pMOG35-55/ TA, 
hMOGo7-10s/DR4-IE, hPLP}75_192/DR4-IE and MCII59-273/DR4-IE) were purified 
from supernatants of Chinese Hamster Ovary (CHO) cells transduced with lenti- 
viruses encoding a monocistronic message in which the peptide-MHC and MHCa 
chains of the complex were separated by the ribosome skipping P2A sequence”®. 
These monomers were engineered to encode a BirA site, a 6 x His tag and a free 
Cys at the carboxyterminal end of the construct. The self-assembled pMHC class 
II complexes were purified by nickel chromatography and used for coating onto 
nanoparticles or processed for biotinylation and tetramer formation as described 
above. The epitopes encoded in the different monomeric constructs used here 
include: 2.5mi; AHHPIWARMDA)"; IGRP_ 145 (TAALSYTISRMEESSVTL) 
and IGRP4_7. (LHRSGVLITHHLQEDYRTY)»; HEL 4-22 (RHGLDNYRG); 
GAD65555(5571)-s67 (NFIRMVISNPAAT)”’; PPly6_90(g8s) (SLQPLALEGSLQSRG)*§; 
IGRP}3-25 (QHLQKDYRAYYTF)*; pMOG3¢_49 (GWYRSPFSRV VH); hMOGo7_ 108 
(TCFFRDHSYQEE); hPLP 175-192 (YIYFNTWTTCQSIAFPSK); and mCII959_273 
(GIAGFKGDQGPKGET). IGRP4_2, IGRP 28-145 and GPlig2-292 (LSIALHVGFDH) or 
2.5mi, PMOG35_55 (MEVGWYRSP FSRVVHLYRNGK), PMOG3_49, hMOGoz- 108 and 
hPLP} 75-192 peptides were purchased from Sigma Genosys, Mimotopes or Genscript. 
Nanoparticles, pMHC-NP, peptide-NP and peptide-MP synthesis and 
purification. We coated pMHCs onto crosslinked dextran-coated or pegylated 
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iron oxide NPs (CLIO- or PFM-NPs, respectively). Briefly, CLIO-NPs were 
treated with ammonia to produce amino groups (NH). Avidin was oxidized with 
sodium periodate and added to the amino-NPs. Further incubation with sodium 
cyanoborohydride was used to generate a stable covalent bond. Finally, biotiny- 
lated monomers were added to the nanoparticles at a molar ratio of 4 mol biotin/ 
mol avidin?’. PFM-NPs were produced by thermal decomposition of Fe(acac); 
in the presence of 2 kDa methoxypolyethylene glycol maleimide (S. Singha 
et al., unpublished data). The NPs were purified using magnetic (MACS) columns 
(Miltenyi Biotec) or an IMag cell separation system (BD BioSciences). To conjugate 
pMHC or free peptide to PFM-NPs, we incubated pMHCs or peptide carrying a 
free carboxyterminal Cys with nanoparticles in 40 mM phosphate buffer, pH 6.0, 
containing 2mM EDTA, 150mM NaCl] overnight at room temperature. The 
pMHC-conjugated nanoparticles were separated from free pMHC or peptide 
using magnetic columns, sterilized by filtration through 0.2m filters and stored 
in water or PBS at 4°C. Quality control was performed using transmission electron 
microscopy, dynamic light scattering, and native and denaturing gel electrophore- 
sis. PMHC or peptide content was measured using different approaches, including 
Bradford assay (Thermo Scientific), denaturing SDS-PAGE, amino acid analysis 
(HPLC-based quantification of 17 different amino acids in hydrolyzed pMHC-NP 
preparations) or dot-ELISA (Singha et al., unpublished data). 

Peptide-coated microparticles were made using carboxylated 500 nm diameter 
polystyrene beads from Polysciences (Warrington, PA) as previously described!” 
The peptides were conjugated to polystyrene beads via carbodiimide chemis- 
try following the manufacturer’s instructions. Briefly, we incubated 25011 PSB 
(containing ~9 x 10!! beads) with 250 1g peptide in 0.1 M MES buffer, pH 5.0 
at room temperature with gentle rolling in the presence of 1mg EDC for 2h. The 
peptide-conjugated polystyrene beads were washed with PBS to remove uncon- 
jugated peptides and analysed with native and denaturing PAGE against serial 
dilutions of unconjugated peptide and microparticle controls. 
pMHC-NP and peptide or peptide-NP therapy in NOD mice. Experiments in 
pre-diabetic NOD mice involved treating (i.v.) cohorts of 10-week-old female mice 
with 7.5 1g of pMHC-NPs, or equivalent amounts of soluble pMHC monomers 
or uncoated nanoparticles twice weekly for 5 consecutive weeks. Experiments in 
diabetic mice involved following cohorts of 10-week-old female NOD/Ltj, NOD 
G6pc2-'~, NOD I110~/~ or NOD Ifng~/~mice for diabetes development by meas- 
uring blood glucose levels with Accucheck Strips (Roche) twice a week. Mice dis- 
playing two consecutive measurements >11mM were considered diabetic and 
treated twice weekly with 7.5 1g pMHC-NPs, nanoparticles delivering a molecular 
equivalent of peptide or free peptide (8g per dose)'®, until stably normoglycae- 
mic (defined as 8 consecutive measurements <11 mM) or until hyperglycaemia 
was considered irreversible (3 measurements >25 mM). In Figs 1g and 4b and 
Extended Data Figure 1h), mice were randomized into treatment with 2.5mi/ 
TA87—-NPs or HEL} 4-22/IA®8’—NPs (Fig. 1g) or with 2.5mi/IA%’—NPs or IGRP4_22/ 
IA2’-NPs (Fig. 4b). In Fig. 1g, IGRP4_39/IA®” and IGRP}3_145/IA®’ were tested in 
separate cohorts of mice. Mice treated with peptide or peptide-NPs (Fig. 1g) were 
randomized into either treatment within the same experiment. In vivo cytokine 
neutralization experiments involved administering mAbs against CD20 (5D2, a 
gift from A. Chan, Genentech; three doses of 250 1g i.v. on days 0-2 relative to 
the onset of hyperglycaemia) or 500|1g of HRPN (rlgG1), IFNy (R4-6A2), IL-10 
(JES5-2A5), TGF-f (1D11) or IL-21R (4A9) (BioXcell) i-p. twice a week for 2 weeks, 
followed by 200 1g per dose for 3 additional weeks. Mice were randomized into 
cytokine-blocking mAb-treatment (IFN+, IL-10, TGF3) or HRPN rat-IgG1 groups. 
Anti-CD20 and anti-IL21R mAbs were tested in separate cohorts of diabetic mice 
(Fig. 3a). Animals were assessed daily for glycosuria (corresponding to >16mM 
blood glucose) and given human insulin isophane (1 IU per day) s.c. if positive. 
Upon treatment withdrawal, NOD mice were monitored for recurrence of hyper- 
glycaemia until 50 weeks of age. 

Peptide, pMHC, pMHC-NP, peptide-NP or peptide-MP therapy in EAE. Six- 
to eight-week-old female C57BL/6, C57BL/6 [127r~'~ or HLA-DR4-IE-transgenic 
C57BL/6 [Ab™!! mice were immunized with 150 peg of PMOG35_55 or hMOGo7-108 
or hPLP}7s_192, respectively in CFA s.c. at the base of the tail, under isofluorane 
anaesthesia. The mice received 300 ng of Pertussis toxin i.v. on days 0 and 3. Mice 
were weighed and scored daily starting on day 10 after immunization. The score 
system used was been reported elsewhere™ and plotted over a 5-point scale. When 
most of the mice showed signs of advanced disease (day 14) or reached maximum 
disease scores (day 21), mice were divided into different treatment groups, syn- 
chronized for weight and disease score averages, and treated twice a week with 
pMHC-coated and uncoated nanoparticles, an identical amount of pMHC mon- 
omer, peptide-coated nanoparticles (at an equivalent dose of peptide), free peptide 
(81g per dose iv. or s.c.)!®, peptide-conjugated microparticles (15g of peptide 
per dose)!” or unconjugated microparticles for 5 weeks. Mice were randomized 
into treatment with pMHC-NPs (one or two different types, depending on the 
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experiment, as described in Figs 1j-n and 2f-h and Extended Data Figures 3 and 4), 
uncoated nanoparticles or no treatment. Peptide, peptide-MPs, peptide-NPs, 
pMHC monomers and uncoated microparticles were tested together; mice were 
randomized into each treatment group as mice reached the indicated disease score 
(Fig. 2f and Extended Data Figures 4f-i). An additional control cohort was treated 
with a single dose of peptide-conjugated microparticles (Extended Data Fig. 4h). 
Anti-cytokine and cytokine receptor mAb blocking studies (Extended Data 
Fig. 5g) involved randomization of mice into each treatment group. 

Peptide, pMHC-NP or peptide-MP therapy in CIA. Bovine collagen II (bCII) 
dissolved in 0.05M acetic acid at 2 mg ml! was emulsified in CFA (v/v) containing 
4mg ml! of killed Mycobacterium tuberculosis (H37Ra). Eight- to twelve- week- 
old HLA-DR4-IE-transgenic C57BL/10.M mice were immunized intradermally 
at the base of the tail with 100 1g of bCII in CFA and boosted with 100\1.g of bCII 
in IFA on days 14 and 28. The size of all four paws was measured using a caliper 
before immunization (day 0) and daily upon disease onset. Disease progression was 
measured as percentage increase in joint swelling relative to day 0. When this value 
reached 130%, mice were divided into different treatment groups and treated with 
pMHC-NPs, Cys-coated (pMHC unconjugated) NPs (25 1g of pMHC for pMHC- 
NPs, or an equivalent amount of iron for Cys-conjugated NPs), free peptide (8 1g 
per dose s.c.)!° or peptide-conjugated MPs (151g of peptide per dose)!” iv. twice 
a week for 5 weeks. Mice were randomized into treatment with either pMHC-NP 
or uncoated nanoparticles, or into peptide or peptide-MP, respectively (Fig. 2a and 
associated Extended Data Data Figures). Mice were also assessed for clinical signs 
of disease up to a maximum clinical score of 12 as reported elsewhere*!. 

Peptide, pMHC-NP, peptide-NP and peptide-MP therapy in human PBMC- 
reconstituted NSG hosts. PBMCs from new or recently diagnosed HLA-DRB1* 
0401* or -DRB1*0301* TID patients (recruited with informed consent, approved 
by the Institutional Review Board at Hospital Clinic) were depleted of CD8* T cells 
using anti-CD8 mAb-coated magnetic beads (Miltenyi Biotech) and injected iv. 
(2 x 10”) into 8-10-week-old NSG hosts. Mice were treated with pMHC-NPs at 
the indicated doses, peptide-coated-NPs (at an equivalent dose of peptide), pep- 
tide alone (8 1g per dose s.c.)!° or peptide-conjugated microparticles (15 1g of 
peptide per dose)!” starting on day 5 after PBMC transfusion, twice a week for 5 
consecutive weeks, or left untreated. Individual patient samples were processed 
separately and injected into two (for pMHC-NP and peptide-NP experiments) or 
three separate mice (for peptide and peptide-MP experiments); one or two of the 
two-to-three hosts used in each of these experiments were treated and the other 
was left untreated (Supplementary Table 2). Therapy-induced expansion of cognate 
CD4° T cells was measured in PLNs and/or spleen as described above. The HLA 
genotype, gender, age, months from diagnosis and type of pMHC-NP tested for 
each patient are summarized in Supplementary Table 2. 

Intraperitoneal glucose tolerance tests. Animals were fasted overnight and chal- 
lenged with 2 mg kg“! of p-glucose i-p. Blood glucose was monitored from the tail 
vein with a glucometer at different time points before and after glucose challenge. 
Serum insulin content was measured using the Mouse Ultrasensitive Insulin ELISA 
(ALPCO). 

Evaluation of systemic cellular and humoral immunity. For the evaluation of 
cellular responses, pMHC-NP-treated and untreated female mice were injected 
with 2 x 10° plaque-forming units (pfu) of recombinant Vaccinia Virus (rVV) 
iv. Cohorts of mice were killed on day 4 and 14 after infection and processed for 
pMHC tetramer staining and rVV titre measurements. Briefly, the ovaries were 
weighed, homogenized using a pestle in 300 ul of RPMI-1640 containing 10% FBS, 
freezed-thawed 3 times followed by 3 rounds of sonication (20 seconds each). Serial 
dilutions of the lysates were added to confluent BSC-1 cell cultures in 6-well plates, 
incubated at 37°C for 2h, washed twice with PBS and cultured in DMEM10. On 
day 2, the supernatants were discarded and the cell layers were stained with crystal 
violet to reveal plaques. 

To evaluate humoral immunity, pMHC-NP-treated and untreated mice were 
immunized i.p. with 100 \1g of DNP-KLH (Alpha Diagnostic International) in 
CFA. An identical boost was performed 3 weeks later. Mice were killed 10 days 
later. Anti-DNP antibody titres were measured by diluting serum samples in PBS 
containing 0.05% Tween 20. Anti-DNP antibodies were semi-quantified using an 
anti-DNP Ig ELISA Kit (Alphadiagnostic International) following the manufac- 
turer’s instructions. 

Proliferation and cytokine secretion assays. CD4* T cells from pMHC- 
NP-treated mice were enriched from peripheral lymphoid organs using a BD 
Imag enrichment kit, stained with pMHC tetramers as described above and sorted 
by flow cytometry. For assays using memory and naive BDC2.5 CD4* T cells, 
cells were enriched using Stem Cell Technologies enrichment kit, stained with 
antibodies and sorted. FACS-sorted cells (2-3 x 10*) were co-cultured with bone 
marrow-derived DCs (2 x 10*) pulsed with 21g ml"! of peptide. Supernatants 
were collected 48h later for measurement of cytokines via Luminex and the cells 


were pulsed with 1 microcurie (j1Ci) of @H)-thymidine and collected after 24h to 
measure thymidine incorporation in triplicates. 

To ascertain whether pMHC-NP therapy promoted the generation of IL-10- 
secreting B-cells in the PLNs of PBMC-engrafted NSG hosts, we stained the PLN 
and splenic cell suspensions of individual mice with anti-hCD4-FITC, anti- 
hCD19-APC and tetramer-PE as described above, and sorted B-cells by flow 
cytometry (FACSAria-BD Biosciences). The B cells sorted from each organ were 
stimulated with LPS (1jgml~, Sigma) for 24h in RPMI-1640 supplemented with 
10% human AB serum. The IL-10 content in the supernatants was measured in 
duplicates via Meso Scale technology using a V-PLEX Custom Human Cytokine 
kit for hIL-10 (Meso Scale Discovery). Data were normalized to the splenic B-cell 
values and reported as fold-change. 

Isolation and in vitro stimulation of CD11b* cells from the PLNs and MLNs. 
CD11b* cells from LNs were obtained by digestion in collagenase D (1.25 1g ml“) 
and DNase I (0.1 ,gml~') for 15 min at 37°C followed by purification with CD11b 
(BD Imag) mAb-coated magnetic beads. Cells were stimulated for 3 days with LPS 
(2;gml~') and the supernatants analysed for cytokine content with a Luminex 
multiplex cytokine assay. 

In vitro suppression assays. FACS-sorted 2.5mi/IA®’ tetramer positive or negative 
cells (2 x 10) were co-cultured with bone marrow-derived DCs (2 x 10°) pulsed 
with 21g ml"! ‘suppressor’ (2.5mi or GPlyg2_292) and ‘responder’ (gp33 or NRP-V7) 
peptides. Responder cells were CD8* T cells (2 x 10*) purified from from 8.3-NOD 
or LCMV-Gp33-specific TCR-transgenic NOD mice using BD-Imag beads. These 
cells were labelled with CFSE (51M) and added to the DC cultures in duplicates 
or triplicates. Dilution of CFSE in the responder cells was measured 48h later by 
FACS. In other experiments, the wells were supplemented within 24h of co-culture 
with HRPN rlgG, anti-IFNy, anti-IL10 or anti-TGF-( (all 10p.g ml~!) or the IDO 
inhibitor, 1-methyl tryptophan (1-MT; 400 ,.M). 

In vivo suppression of crosspresentation. For crosspresentation assays in 
non-transgenic mice, we transfused CFSE-labelled 8.3-CD8* reporter cells 
(5-10 x 10°) into untreated or pMHC-NP-treated mice and measured CFSE 
dilution in the hosts’ lymphoid organs within 7 days after transfer. 

Adoptive transfer of suppression. Splenic CD4* or CD8* T cells (107) from 
untreated mice or mice treated with 10 doses of 2.5mi/IA®’—NPs or uncoated 
nanoparticles were transfused into 5-10 week-old NOD scid females. The hosts 
were transfused 24h later with 2 x 107 CD4* or CD8* T-cell splenocyte mixtures 
purified from female NOD donors. The hosts were monitored for development of 
diabetes for at least 90 days after transfer (Fig. le). In another experiment, the hosts 
were treated twice a week with 2.5mi/IA2’—NPs (Fig. le). In other experiments 
(Fig. 3e), CD4* or CD8* T-cell-reconstituted 5-6-week-old NOD scid females 
were transfused with 5 x 10° CD19* cells purified from the PLNs of mice treated 
with 10 doses of uncoated or 2.5mi/IA8’-coated NPs during the preceding 5 week 
(Fig. 3d). B-cells were purified using the EasySep Mouse CD19-positive selec- 
tion Kit II (StemCell Technologies). Other cohorts, studied separately (Fig. 3e), 
received PLN or MLN CD19* cells (5 x 10°) plus total splenic CD4* T cells (107) or 
2.5mi/IA®’ tetramer* (2 x 10°) or tetramer~ CD4* T cells (107) from 2.5mi/IA®”— 
NP-treated donors. The hosts were randomized into each transfusion group and 
monitored for development of diabetes together. Figure 3e includes data from the 
corresponding cohors studied in Figs le and 3d. Isolation of 2.5mi/IA®’ tetramer* 
and tetramer™ cells from total splenic CD4* T cells of 2.5mi/IA8’—NP-treated mice 
was performed using anti-PE mAb-coated microbeads and MACS LD columns 
(Miltenyi Biotec). 

B-cell proliferation and B,.g induction in vivo and B,eg suppression in vitro. To 
isolate splenic DCs, spleens were digested in collagenase D and DNase for 15 min 
at 37°C and DCs purified using anti-CD11c mAb-coated magnetic beads (MACS). 
The cells were pulsed with 10g ml! of 2.5mi or GPIyg2-292 peptide for 2h at 37°C 
and labelled with CFSE (0.5|1M) or PKH26 (21M), respectively. Labelled cells 
(5-10 x 10°; mixed at 1:1 ratio) were administered i.v. into pMHC-NP-treated or 
untreated NOD mice. Three days later, we compared the ratios of CFSE* versus 
PKH26" cells in the spleens of the different hosts by FACS. Similar experiments 
were done using peptide-pulsed splenic B cells isolated from female donor mice 
using anti-B220 mAb-coated magnetic beads (MACS). 

For in vivo Byeg induction assays, B cells from NOD 110%"? (tiger) mice were 
enriched using a CD19 enrichment kit (Stem Cell Technologies) and pulsed with 
2.5mi or GPIpg2-292 peptides (10pgml') for 2h at 37°C. The peptide-pulsed B 
cells were washed twice with PBS, labelled with PKH26 and transfused (1 x 10°) 
into pMHC-NP-treated or untreated mice. The hosts were killed 7 days later and 
their spleens labelled with anti-B220-APC and biotinylated anti-CD1d or anti-CD5 
mAbs and Streptavidin-PerCP. PKH26? cells were analysed for presence of 
eGFP*CD1d"* or CD5* cells by flow cytometry. 

To determine the role of Ta1-derived cytokines in B,eg formation (Extended 
Data Fig. 5u), we repeated the experiments described above but using 3 x 10° 
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B cells and hosts treated with 250 or 500 1g (given i-p. daily from day —3 to day 6 
relative to B-cell transfer) of anti- HRPN (rlgG1), anti-IL-10 (JES5-2A5), anti- TGF3 
(1D11) or anti-IL-21R (4A9) mAbs (BioXcell). Hosts were randomized into each 
antibody-treatment group and studied together. 

To measure the ability of the Tpl-induced Byeg cells to suppress the anti- 
gen-induced activation of T cells in vitro, we isolated CD19* B cells from the 
PLNs of age-matched untreated NOD mice or NOD mice treated with 10 doses 
of 2.5mi/IA8’—NPs and cultured these cells with LPS (10,.g ml!) overnight. 
We then cultured these cells (2 x 10) with 2.5mi-peptide-pulsed (0.1 ,1gml~!) bone 
marrow-derived DCs (2 x 10*) and CFSE-labelled BDC2.5 CD4* cells (4 x 10%). 
Dilution of CFSE in CD4* cells was measured 3 days later. 

CD25*CD4* T,eg depletion. NOD mice were treated with 500j1g of anti-CD25 
(PC61.5.3, BioXcell) i.p. 3 times weekly from 8 weeks of age, followed by 
10-injections of pMHC-NPs given twice weekly starting at 10 weeks of age. 
Average CD4*CD25* and FOXP3-eGFP*CD4° T-cell depletion was 90% and 
70%, respectively. 

Histology. Tissues were fixed in 10% formalin and embedded in paraffin. H&E- 
stained pancreata were scored for insulitis as reported”*. Briefly, insulitis was scored 
as: 0, none; 1, peri-insulitis; 2, infiltration covering <25% of the islet; 3, covering 
25-50% of the islet; and 4, covering >50% of the islet. 

Spinal cord and brain tissues were fixed in 10% buffered formalin for a mini- 
mum of 24h, embedded in paraffin and sectioned at 61m. Slides from paraffin- 
embedded tissues were deparaffinized and subjected to antigen retrieval by 
steaming the slides in 10 mM sodium citrate buffer (pH 6.0) for 20 min and cooling 
at room temperature for 20 min. For immunohistochemistry, slides were fixed with 
10% formalin and treated with 3% HO, in methanol at —20°C. Sections were 
permeabilized with 0.25% Triton-X 100 and blocked with a skim milk blocking 
solution. Rabbit anti-IBA1 (Wako, 1:500) or rat anti- MBP (Abcam) were incubated 
at 4°C overnight followed by respective biotinylated secondary antibodies (1:500), 
avidin-biotin complex, and 3,3’-diaminobenzidine. Sections were counterstained 
with haematoxylin and eosin, dehydrated with graded ethanol and mounted with 
Acrytol. For histological myelin staining, slides were fixed with 10% formalin or 
deparaffinized, dehydrated with graded ethanol, and incubated with 0.2% luxol 
fast blue in 95% ethanol at 65°C. Slides were developed in 0.05% lithium carbonate, 
counterstained with haematoxylin and eosin, and mounted with Acrytol. Images of 
cerebellum were taken on an Olympus bright-field microscope. Inflammatory foci 
(dense nuclear clusters or perivascular cuffs with corresponding demyelination) 
were counted and their size measured using Image] software. For quantification 
of relative IBA1 intensity, blinded observers ranked images from highest to lowest 
intensity. 

Knee joints from bCII-immunized mice were fixed in 4% buffered formalin 
overnight, and decalcified with 14% EDTA over 3 weeks. Decalcified paws were 
embedded in paraffin, sectioned at 8 jm and stained with haematoxylin and eosin 
to score infiltration and pannus formation on a scale of 5, where 5 corresponds 
to erosive arthritis, with severe infiltration and pannus covering 60% of the joint 
space. Proteoglycan depletion at the articular surface of the tibia and femur was 
assessed by the loss of safranin-O stain intensity. For this, sections were depa- 
raffinized, hydrated and stained with haematoxylin before staining with 0.05% 
aqueous fast green for 5 min. Slides were fixed with 1% acetic acid and stained 
with 0.1% aqueous safranin-O for 2 min, dehydrated with graded ethanol, cleared 
with xylene and mounted with DPX. Scoring was done on a scale of 0 to 3 cor- 
responding to: 0, 0% depletion, 1, low (<25%), 2, moderate (25-50%), and 3, 
severe (>50%). Destruction of articular cartilage included an assessment of the 
presence of dead chondrocytes (empty lacunae) and was scored on a scale of 3 
(0, no empty lacunae; 3, complete loss of chondrocytes on articular cartilage/ 
severe cartilage erosion). 

Isolation of CNS-infiltrating lymphocytes. Mice were anesthetized with 
Ketamine-Xylazine and perfused with PBS through the heart left ventricle. The 
brain and spinal cord were isolated manually, cut into small fragments and digested 
with a solution of collagenase D (1.25,.gml-!) and DNase I (1% w/v) in HBSS for 
30min at 37°C. The digested CNS was passed through a 701m cell strainer. Cells 
were resuspended in DMEM (supplemented with 2% FBS and 10 mM HEPES) 
and 100% Percoll (to a final Percoll concentration of 30%). The solution was lay- 
ered onto 65% Percoll and centrifuged at 380g for 30 min at room temperature. 
The mononuclear cell layer lying at the interphase was washed with RPMI before 
further analyses. 

Quantitative RT-PCR. RNA was extracted from 2.5mi/IA®’ tetramer* or 
tetramer CD4* T cells sorted from 2.5mi/IA®’-NP-treated NOD mice and 
stimulated in vitro with anti-CD3/anti-CD28 mAb-coated dynabeads. 

Each tetramer* sample corresponded to cells pooled from 2-3 mice. RNA 
was reverse transcribed and cDNA plated in Mouse Immunology 384 StellArray 
qPCR plates (Bar Harbour BioTechnology) with 2X SYBR Green Master Mix 
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(Applied Biosystems). The plate was run in a 7900HT Applied Biosystems real- 
time PCR instrument, and the raw data was analysed using the Global Pattern 
Recognition (GPR) analysis tool (http://www.gene-quantification.com/qpcr-array. 
html). mRNA isolated from additional samples was subjected to RT-qPCR using 
primers specific for IL-21 (Forward: 5’‘-TCATCATTGACCTCGTGGCCC-3'; 
Reverse: 5’-ATCGTACTTCTCCACTTGCAATCC-3’), IL-10 (Forward: 5’-CT 
TGCACTACCAAAGCCACA-3’; Reverse: 5/-GTTATTGTCTTCCCGGCTGT-3’), 
c-Maf (Forward: 5‘-AGCAGTTGGTGACCATGTCG-3’; Reverse: 5/-TGGA 
GATCTCCTGCTTGAGG-3’), IFN-7 (Forward: 5‘’-TGAACGCTACACACTGCA 
TCTTGG-3’; Reverse: 5‘-CGACTCCTTTTCCGCTTCCTGAG-3’), LAG-3 
(Forward: 5’‘-TCCCAAATCCTTCGGGTTAC-3’; Reverse: 5’/-GAGCTAGACTC 
TGCGGCGTA-3’), CD49b (Forward: 5’-CCGGGTGCTACAAAAGTCAT-3’; 
Reverse: 5‘-GTCGGCCACATTGAAAAAGT-3’), Aryl Hydrocarbon Receptor 
(Forward: 5’-CGTCCCTGCATCCCACTACTT-3’; Reverse: 5’-GGACATGG 
CCCCAGCATAG-3’) and ICOS (Forward: 5’-TGACCCACCTCCTTTTCAAG-3'; 
Reverse: 5’-TTAGGGTCATGCACACTGGA-3’). 

pMHC-NP- induced upregulation of Trl transcripts in in vitro-activated CD4T 
T cells was performed by culturing mouse naive eGFP” BDC2.5-CD4* T cells from 
BDC2.5 NOD Foxp3-eGFP mice (CD62L""FOXP3~eGFP’; 1.5 x 10° ml!) with 
anti-CD3/anti-CD28 mAb-coated microparticles (1 bead per cell) for three days in 
the absence of APCs, followed by a one day culture of re-purified (microparticle- 
free) CD4* T cells in rhIL-2 (30 IU ml’), and a 6-day culture with 2.5mi peptide 
(10 pg ml“), 2.5mi/IA®” monomers (25 jug pMHC per ml), 2.5mi/IA®”—NPs (25 1g 
pMHC per ml and 50jg ml? iron), or unconjugated nanoparticles (50 ig iron 
per ml). Relative gene expression was calculated using unstimulated cultures as 
controls. 

pMHC-NP- induced upregulation of Ta] transcripts in naive compared to mem- 
ory BDC2.5 CD4* T cells in vivo was done by transfusing naive (CD44™°¢CD62L") 
or memory (CD44"'CD62L»”) eGEP-CD4* T cells from BDC2.5-TCR-transgenic 
NOD or NOD Foxp3-eGFP mice (Thy1>*) (1-1.5 x 10° cells per host) into 
NOD. Thy!“ hosts and by treating the hosts with four doses of 2.5mi/IA%’-NPs 
over two weeks or leaving them untreated. Two and a half weeks later, Thyl>+cD4* 
T cells were sorted from the hosts and challenged with anti-CD3 and anti-CD28- 
coated magnetic Dynabeads for 3 days before mRNA extraction and RT-qPCR 
using primers specific for c-Maf, IL-21, IL-10, IFNy, LAG-3 and CD49b. 

To compare levels of IL-10 mRNA in the tetramer* compared with tetramer 

CD4* T cells of pPMHC-NP-treated PBMC-engrafted NSG hosts, we stained sple- 
nocytes with anti-hCD4-FITC, anti-hCD19-APC and tetramer-PE as described 
above, and sorted tetramer* and tetramer’ cells from individual hosts by FACS 
(FACSAria-BD Biosciences). Sorted cells were cultured for 72 h in RPMI- 
1640 containing 10% human AB serum, in the presence of Dynabeads Human 
T-Activator CD3/CD28 (LifeTechnologies) using a 1:1 cell to bead ratio. Total 
RNA from cell pellets was reverse-transcribed using a dual reverse transcriptase/ 
lysis solution containing 5mM DTT, 2U ml! RNAase, 500 mM dNTPs, 10 Um! 
of Superscript reverse transcriptase (Invitrogen, LifeTechnologies), 100 mgml! 
bovine serum albumin, 1% Triton X-100, 25 ng ml! Oligo dT (Invitrogen), 
0.5nM spermidine, and 1X First Strant buffer (Invitrogen) in 20,11 for 60 min at 
50°C and 15 min at 70°C. We then mixed 1 1] of the cDNA reaction volume with 
12.5411 of Power SyBRGreen PCR master mix solution (Applied Biosystem) and 
amplified with a real-time PCR machine (7900HT, Applied Biosystems) using 
the following primers: 3-actin (Forward: 5’-CTGGAACGGTGAAGGTGACA-3’; 
Reverse: 5’‘-AAGGGACTTCCTGTAACAATGCA-3’), IL-10 (Forward: 5’-AA 
GACCCAGACATCAAGGCG-3’; Reverse: 5‘-AATCGATGACAGCGCCGT 
AG-3’). 
Statistical analyses. The sample size values described in the figure legends cor- 
respond to the number of individual mice tested (not replicates) and data shown 
correspond to pooled data from different experiments. Data were compared 
by Student's t-test, Mann-Whitney U-test, chi-square, log-rank (Mantel-Cox), 
Pearson correlation or two-way ANOVA tests. Statistical significance was assumed 
at P< 0.05. 
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Extended Data Figure 1 | Sustained expansion of cognate Trl-like 
CD4* T cells by pMHCII-NP therapy restores normal glucose 
homeostasis in diabetic NOD mice by suppressing antigen presentation 
and the activation of non-cognate autoreactive T cells in the PLNs and 
the progression of insulitis. a, Top left, expansion of cognate CD4* 

T cells by 2.5mi/LA8’-NPs in anti-CD25 mAb-treated NOD Foxp3-eGFP 
mice. Data correspond to 8-week-old mice treated three times a week with 
500g of a depleting anti-CD25 mAb ip. or control anti- HPRN mAbs, 
followed by 10 doses of 2.5mi/IA®’-NPs starting at 10 weeks of age 

(two doses per week; n = 4 mice each). Bottom, the tetramer*CD4* T cells 
from anti-CD25 mAb-treated mice express Tp1 markers. Right, percentage 
of circulating FOXP3+eGFPtCD4* (top) and CD25*CD4" cells (bottom). 
b, Tetramer*CD4* T cells sorted from 2.5mi/IA®’-NP-treated mice 
proliferate and produce IL-10 and, to a lesser extent IFNy in response to 
stimulation with 2.5mi peptide-pulsed DCs (n = 3 mice). c, Representative 
cell surface CD49b and LAG-3 profiles on tetramer*CD4* T cells from 
BDC2.5 NOD Foxp3-eGFP mice compared with tetramer CD4* T cells 
from transgenic or wild-type NOD mice (n= 4). d, Upregulation of 
CD49b and LAG-3 on anti-CD3/anti-CD28 mAb-activated BDC2.5 

CD4* T cells from BDC2.5 NOD Foxp3-eGFP mice in response to 
2.5mi/IA®’—-NP (251g pMHC per ml) versus 2.5mi peptide (10g ml!) 

or 2.5mi/IA®’ monomers (25 ug pMHC per ml). e, Upregulation of eGFP 
(IL-10) in anti-CD3/anti-CD28 mAb-activated BDC2.5 CD4* T cells from 
BDC2.5 NOD 1110°"” mice in response to 2.5mi/IA®’—NP as a function 

of CD49b and LAG-3 expression. f, Expression of eGFP (IL-10) in the 
CD4* T cells of 2.5mi/IA2’-NP-treated NOD I110°'? mice (2 doses per 
week for 5 weeks) as a function of CD49b and LAG-3 expression (left, 
representative profiles; right, eGFP MFI values) (n = 8). g, Proliferation of 
CFSE-labelled 8.3-TCR-transgenic CD8* T cells (IGRP206-214/ NRP-V7- 
specific) in response to 2.5mi/NRP-V7-peptide-pulsed or unpulsed 


Time after glucose challenge (min) 


DCs in the presence of tetramer™ or tetramer? CD4* T cells from 
2.5mi/IA®’-NP-treated mice and in the presence or absence of cytokine- 
blocking mAbs, rat IgG (negative control) or 1-methyl-L-tryptophan 
(1-MT; an IDO inhibitor). Data correspond to average of proliferated cells 
in 3-7 experiments per condition. h, Changes in blood glucose levels of 
spontaneously hyperglycaemic (>11mM) female NOD mice treated with 
2.5mi/IA8’—NP, IGRP4_29/IA®’-NP, IGRP}98-145/LA8’-NP or HEL}4.22/ 
IA8’-NP (n= 6-9 per group), IGRP4_22 peptide or IGRP4_22 peptide-NPs 
(n=9, 4-5 each). Mice received two doses per week until irreversibly 
hyperglycaemic or normoglycaemic for 4 consecutive weeks, at which 
point treatment was withdrawn. i, Incidence and timing of disease relapse 
in hyperglycaemic female NOD mice rendered stably normoglycaemic 
by treatment with 2.5mi/IA8’-NP, IGRP4-39/IA®”-NP or IGRP}3_145/ 
IA8’-NPs upon treatment withdrawal (after 4 consecutive weeks of 
normoglycaemia). Data correspond to responder mice in Fig. 1g. 

j, Post-prandial serum insulin levels in pMHC-NP-treated mice that 
reverted to normoglycaemia until 50 weeks of age (n = 6) versus newly 
diabetic (n = 12) and non-diabetic age-matched untreated controls 

(n= 10). k, Intra-peritoneal glucose tolerance tests (IPGTT) of the 

mice in h. 1, Areas under the curve (AUC) in the IPGTTs shown in k. 

m, IPGTT serum insulin levels corresponding to the mice in k. 

n, Proliferation of CFSE-labelled IGRP396_214-reactive 8.3-CD8* T cells 
in the PLNs compared with MLNs of 2.5mi/IA®”-NP-treated mice 

that reverted to normoglycaemia until 50 weeks of age, non-diabetic 
age-matched untreated controls and newly diabetic mice. Left panels 
show representative FACS profiles. Right panel compares percentages 

of proliferated cells in the PLNs after subtraction of the background 
proliferation values in non-draining MLNs (n = 6-8 mice per group). 

P values were calculated by Mann-Whitney U-test, log-rank 
(Mantel-Cox) test or two-way ANOVA. Data are averages + s.e.m. 
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Extended Data Figure 2 | Nanoparticles coated with different 
T1D-relevant pMHCII complexes expand cognate Tr1-like CD4* 

T cells in vivo to similar extent, regardless of epitope dominance or 
role of the target T-cell specificity in the disease process. a, Percentage 
of tetramertCD4* T cells in the PLN, MLN and bone marrow (BM) of 
2.5mi/IA®’-NP-treated mice that reverted to normoglycaemia until 

50 weeks of age (n = 5-6 mice per lymphoid organ) or relapsed (n = 1-2) 
compared with newly diabetic (n = 5-6) and non-diabetic age-matched 
untreated controls (n = 4-6). b, Percentage of tetramer*CD4* T cells in 
the splenic CD4* T cells of 2.5mi/IA’-NP-treated mice that reverted to 
normoglycaemia until 50 weeks of age or of age-matched non-diabetic 
untreated mice, stained with two T1D-relevant but non-cognate pMHCII 
tetramers (n = 3-4 per group). c, Percentage of tetramer*CD4* T cells in 
blood, spleen, PLN, MLN and bone marrow of IGRP4_»5/IA8”—NP-treated 
mice that reverted to normoglycaemia until 50 weeks of age (n = 5-6 mice 
per lymphoid organ) compared with newly diabetic (n = 5-8) and 
non-diabetic age-matched untreated controls (n = 4-6). d, Percentage of 
tetramer’ CD4* T cells in blood, spleen, PLN, MLN and bone marrow 

of IGRP}o3-145/IA2”-NP-treated mice that reverted to normoglycaemia 


until 50 weeks of age (n = 5-7 mice per lymphoid organ) compared with 
newly diabetic (n = 4-7) and non-diabetic age-matched untreated controls 
(n=5-7). e, Representative IGRP4_72/IA8’, IGRPj2¢-145/IA®” and GPlyg9-99/ 
IA’ tetramer staining profiles for splenic CD4* T cells from IGRP4_2/IA®’— 
NP- and IGRP}2s-14s/IA®”—NP-treated compared with untreated NOD 
mice. f, Percentages of blood CD4* T cells of IGRP4_22/IA®’-NP- or 

IGRP yog_145/IA®’-NP-cured, HEL 4-22/IA8’-NP-treated and age-matched 
non-diabetic untreated mice stained with non-cognate pMHCII tetramers 
(n= 3-7 per group). g, The tetramer*CD4* T cells of mice treated with 
IGRP 19g 145/IA8’-NP (top) and IGRP4.7/IA8’—NP (bottom) proliferate 
and produce IL-10 specifically in response to stimulation with IGRP4_29 

or IGRPj2-145-peptide-pulsed DCs, respectively (n =3 mice each). 

cpm, counts per minute. h, Percentages of IGRP4_»>/IA®’ tetramer*CD4* 
T cells in blood, spleen, PLN, MLN and bone marrow of NOD mice at the 
onset of hyperglycaemia or upon treatment with IGRP4_)/IA®’-NPs, or 
IGRP4_22 peptide or IGRP4_22 peptide-coated nanoparticles (n = 5-9 mice 
per organ). P values were calculated by Mann-Whitney U-test. Data are 
averages + s.e.m. 
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Extended Data Figure 3 | EAE-relevant pMHCII-NPs expand proliferate and produce IL-10 and, to a lesser extent, IFNy in response 
cognate IL-10-secreting Ta1-like CD4* T cells and ameliorate to stimulation with pMOG3s_49 peptide-pulsed DCs. e, Left and middle, 
established clinical and pathological signs of EAE. a, b, Changes representative luxol fast blue (LFB)/H&E cerebellum staining images from 
in the average weights of C57BL/6 mice immunized with pMOG35_55 untreated and treated mice from b showing presence of inflammatory 

and treated with pMOG3s_49/ IA>_NPs or uncoated nanoparticles foci and areas of demyelination (red arrows). Right, average number of 
starting on days 14 (a) or 21 (b) after immunization. c, Percentage of inflammatory foci per section. Data corresponds to 4 untreated and 
PMOG3e_49/IA> tetramer*CD4* T cells in peripheral lymph nodes, 5 treated mice. f, Representative LFB/H&E-stained spinal cord sections 
bone marrow and central nervous system (CNS) of mice from a and b. from mice in b. Data were compared with Mann-Whitney U-test. 

d, The tetramer*CD4°* T cells of pMOG3g_49/ TA>_NP-treated mice Data are averages + s.e.m. 
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Extended Data Figure 4 | EAE- or CIA-relevant pMHCII-NPs expand 
cognate Trl-like CD4* T cells and ameliorate clinical and pathological 
signs of EAE or CIA in HLA-DR4-IE-transgenic C57BL/6 [Ab™" or 
C57BL/10.M mice. a, Changes in the average EAE scores of HLA-DR4-IE- 
transgenic C57BL/6 TAb™!! mice immunized with hPLPj75_192 or 
hMOGo7-108 and treated with hPLP}75_192 /DR4-IE or hMOGo7_10s/DR4- 
IE-NPs or uncoated nanoparticles starting on the day when mice reached 
a score of 1.5 (to synchronize the groups for disease activity) (n = 3-4 

per group). b, Percentage of tetramer*CD4* T cells in spleen, blood, 
cervical and inguinal LNs and CNS of mice from a. Data correspond to 

4 pMHC-NP-treated and 6 control-NP-treated mice. c, Changes in the 
average weights of HLA-DR4-IE-transgenic C57BL/6 [Ab™" mice from a, 
immunized with hPLP}75_192 or hMOGoz_10g and treated with 
hPLP}75_192/DR4-IE-NPs, hMOGov-10s/DR4-IE-NPs or uncoated 
nanoparticles when the mice reached a score of 1.5. d, LFB/H&E staining 
of the cerebellum of HLA-DR4-IE-transgenic C57BL/6 IAb™" mice from 

a showing reductions in inflammation and demyelination in mice treated 
with hPLPj75_192 /DR4-IE or hMOGo7_103/DR4-IE-NPs compared with 
controls. e, Percentage of tetramer*CD4* T cells in lymph nodes and bone 
marrow of the mice in Fig. 2a (C57BL/10.M HLA-DR4-IE mice immunized 
with bovine collagen) at the end of follow-up (10 doses, 5 weeks). 

f, Changes in the average weights of HLA-DR4-IE-transgenic C57BL/6 


IAD" mice immunized with hPLP}7s_192 from Fig. 2f. g, Representative 
LFB/H&E staining of the cerebellum of HLA-DR4-IE-transgenic C57BL/6 
TAb™! mice immunized with hPLP}75_92 and treated with hPLP175_192/DR4- 
IE-NPs, hMOGo7-103/DR4-IE-NPs, hMOGo7-_0s peptide i.v. or s.c. (8 ug 
per dose), hMOGo7-19g/DR4-IE monomer (25 1g per dose), hMOGo7-108 
peptide-NPs (using the molar equivalent of peptide delivered via pMHC- 
NPs; 0.68 jug per dose), or hMOGo7-10s peptide-MPs (15 pg peptide per 
dose) compared with mice left untreated or treated with uncoated NPs 

or MPs (at the same NP/MP number). h, Changes in the average EAE 
scores and body weights of HLA-DR4-IE-transgenic C57BL/6 IAD™" mice 
immunized with hPLPj7s_192 in response to treatment with hMOGo7- 108 
peptide i.v. or s.c. (8 ug per dose’®), hMOGo7_10s/DR4-IE monomer 

(25g per dose), hMOGoz_ 10g peptide-NPs (0.68 .g peptide per dose), 
hMOGo7_10s peptide-MPs (15 jg peptide per dose'’), or a single dose of 
hMOGo7_10s peptide-MPs (15 jug peptide!’) compared with mice 

left untreated or treated with uncoated NPs or MPs (at the same NP/MP 
number) (n = 4-5 per group). The cohort of mice treated with one dose 
had to be terminated after 2.5 weeks, owing to rapid progression of 
disease. i, Percentages of tetramer’ CD4* T cells in spleen, blood, cervical 
and inguinal LNs and bone marrow of mice from h (n = 3-9 per group). 
Data were compared with Mann-Whitney U-test or two-way ANOVA. 
Data are averages + s.e.m. 
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Extended Data Figure 5 | Disease reversal by pMHC-NPs is driven 
by the Tp] cytokines IL-21, IL-10 and TGF-6 and involves several 
downstream cellular targets. a, Changes in blood glucose levels in 
diabetic NOD mice (>11 mM) treated with IGRP4_37/IA®”-NPs and 
blocking anti-IL-10, anti-IFN7 or anti-TGF-8 mAbs or anti- HRPN 
rat-IgG (n= 4-6 per group). b, c, Percentages of tetramer*CD4* T cells 
in the spleens (b), and proliferation of CFSE-labelled 8.3-CD8* T cells 


in the PLNs verus MLN of the mice from Fig. 3a at the end of follow up (c). 


d, Changes in blood glucose in hyperglycaemic NOD, NOD I1107/~ 

and NOD Ifng’/~ mice (n=3—6 per group) in response to 2.5mi/IA®’— 
NPs. e, f, Percentages of tetramer*CD4* T cells in the spleens (e), and 
proliferation of CFSE-labelled 8.3-CD8* T cells in the PLNs versus MLN 
of the mice from d at the end of follow up (f). g, EAE scores of mice 
treated with pMHC-NPs and rat-IgG or blocking mAbs (n = 4 per group). 
h, LFB/H&E staining of the cerebellum of HLA-DR4-IE-transgenic 
C57BL/6 IAb™" mice from g, highlighting differences in inflammation 
and demyelination in mice treated with hPLP}75_192/DR4-IE-NPs and 
rat-IgG versus blocking anti-IL-10, anti-TGF-@ or anti-IL-21R mAbs. 

i, Changes in the average body weights of HLA-DR4-IE-transgenic 
C57BL/6 IAb™" mice from g. j, Percentage of tetramer*CD4* T cells in 
spleen, blood and inguinal LNs of mice from g (n=4 per group). 

k, 1, Changes in the average EAE scores (k) and body weights (1) of 
C57BL/6 I127r~/~ mice immunized with pMOG3;_s5 and treated with 
PMOG3s_49/IA°-NPs or uncoated nanoparticles starting on the day 

when mice reached a score of 1.5 (to synchronize the groups for disease 
activity) (1 =7 and 4, respectively). m, Representative IBA1 and LFB/H&E 
stainings of the cerebellum and the corresponding relative rank scores of 
mice from k (n= 3 and 4, respectively). n, Percentage of tetramer*CD4* 

T cells in spleen, blood, inguinal LNs and bone marrow of mice from k (left), 
and representative CD49b and LAG-3 staining profiles of tetramer* versus 
tetramer” cells (right). 0, Percentage of B220* cells in the PLNs or MLNs 
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of 2.5mi/IA®”—-NP- or HELj4.22/IA®’-NP-treated mice (n = 4 per group). 

p, Correlation between the percentages of PLN and splenic B220* cells and 
2.5mi/IA®’ tetramer*CD4 T cells in additional cohorts of mice treated 
with 2.5mi/IA®’-NPs, over a range of total pMHC dose (0.75-25 1g 

of total pMHC) (n = 24-28). q, Left, in vitro proliferation of CFSE-labelled 
BDC2.5 CD4* T cells against 2.5mi or GPI2g2-292 peptide-pulsed B cells 
purified from the PLNs or MLNs of untreated NOD mice or mice treated 
with 2.5mi/IA®’-NPs (n = 5-6 per group). Right, representative CFSE 
dilution profiles. Briefly, profiles show the extent of CFSE dilution in 
CFSE-labelled BDC2.5 CD4* T cells cultured in the presence of 2.5mi 

or GPlyg2_292 peptide-pulsed B cells purified from the PLNs or MLNs of 
untreated or 2.5mi/IA8’-NP-treated NOD mice. r, PLN-derived B cells 
(10°) from 2.5mi/IA®’—NP-treated mice secrete IL-10 ex vivo in response 
to LPS (1g ml~'). Data correspond to 6 pMHC-treated and 5 untreated 
NOD mice. s, t, Changes in the percentages of 2.5mi (PKH26-labelled) 
compared with GPI489 292 peptide-pulsed (CFSE-labelled) B cells (s) or 
DCs (t) 7 days after transfer (at 1:1 ratio) into untreated or 2.5mi/IA®’- 
NP-treated NOD mice. Histograms show averaged ratios for each cell type 
and condition (n= 3-4 mice per cell type and condition). u, Percentages 
of CD5+CD1d*eGFP*B220* cells in mice treated as in Fig. 3b plus 
blocking Abs (n=4 each). v, LPS-stimulated PLN B cells from NOD mice 
treated with 10 doses of 2.5mi/IA®’—NPs suppress the proliferation of 
CFSE-labelled BDC2.5 CD4* T cells by 2.5mi peptide-pulsed DCs in vitro, 
as compared to LPS-stimulated PLN B cells from untreated controls. 

x, Percentage of CD19*CD3° cells in blood before and after 3 doses of 
250 pg of anti-CD20 mAb (n= 4). y, 2.5mi/IA8’-NP-induced upregulation 
of IL-21 and IL-10 mRNA in memory eGFP~ BDC2.5 CD4* T cells from 
BDC2.5-TCR-transgenic NOD Foxp3-eGFP donors in NOD Thy1* hosts 
(n=5). P values were calculated by Pearson correlation, Mann-Whitney 
U-test or two-way ANOVA. Data are averages = s.e.m. 
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Extended Data Figure 6 | Effects of cytokine blockade or genetic deficiency on the cytokine profile of cognate CD4* T cells expanded by 
2.5mi/LAS’-NPs. n = 3 mice each. Data are averages + s.e.m. 
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Extended Data Figure 7 | Human T1D-relevant pMHCII-NPs, but not Supplementary Table 2. b, Representative FACS profiles (cognate versus 
free peptide or peptide-coated nanoparticles or microparticles,expand _ control tetramer staining in splenic hCD4* T cells) of human healthy 


cognate Tp1-like CD4* T cells in human PBMC-engrafted NSG hosts. control PBMC-engrafted NSG hosts treated with IGRP}3_»5/DR3-NPs (left), 
a, FACS profiles (cognate versus control tetramer staining in hCD4* or human T1D PBMC-engrafted NSG hosts treated with IGRP}3_25 

T cells) of samples from mice identified as responders in Supplementary peptide, IGRP}3_25 peptide-coated nanoparticles, IGRP}3_25 peptide-coated 
Table 2. Numerical data on tetramer™ T cells are presented on microparticles, or left untreated (right). See Fig. 5 legend for details. 
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Extended Data Figure 8 | Schematic of the proposed mode of operation 
of pMHCII-based nanomedicines. pMHCII-coated NPs (pMHC-NP, 
lacking costimulatory molecules) promote the differentiation of disease- 
primed (antigen-experienced) IFN\-producing CD4* Ty1-cells into 
memory Tpl-like CD4* T cells followed by systemic expansion. This 
differentiation process (but not the subsequent expansion) requires both 
IFN+ and IL-10, whereas IL-27 is dispensable. The pMHC-NP-expanded 
(mono-specific) autoreactive Ta1-like CD4* T cells then suppress other 
autoreactive T-cell responses by secreting IL-21, IL-10 and TGF-8, 

which act on local APCs (B cells, CD11c* and CD11b* cells) that have 
captured the cognate autoantigen and thus present cognate pMHCII 
complexes to the expanded Ta1-like cells. This interaction inhibits the 
proinflammatory function of the targeted APCs and blocks their ability to 
present other pMHC class I and class II complexes to non-pMHC-NP- 
cognate autoreactive T-cell specificities (note that the local APCs 


meee 


IFN-y 


oo naive 
eo memory 


activated 


mTR1 $07 cognate B cell 
t ) Breg 
% 


3% = pMHC-NP 
SH cell death 
(=) Target cells 


DC 


antigen-loaded 
APC 


myeloid 
APC 


IL-10 


Cognate 
B cell 


uptake both cognate and non-cognate autoantigens shed into the milieu 
simultaneously). Suppression of antigen-presentation requires IL-10 

and TGF-§ but not IFNy or IL-21. Furthermore, cognate interactions 
between the pMHC-NP-expanded Tal CD4* T cells and autoreactive 

B cells specific for the cognate autoantigen (able to display the cognate 
pMHCII complex on the surface) promotes their differentiation into Breg 
cells in an IL-21-dependent manner, which contribute to promote local 
immunosuppression, likely by secreting IL-10. Suppression of antigen 
presentation selectively targets APCs displaying the cognate pMHC, 

but as local APCs that capture the cognate autoantigen also capture other 
autoantigens simultaneously, the autoregulatory CD4* T cells expanded 
by pMHC-NPs blunt the presentation of other autoantigenic pMHC 
complexes to a broad range of autoreactive T cells. This suppression is 
disease-specific and self-limiting. 
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The dynamic N'-methyladenosine 
methylome in eukaryotic messenger RNA 


Dan Dominissini!?*, Sigrid Nachtergaele!:?*, Sharon Moshitch-Moshkovitz**, Eyal Peer?-+, Nitzan Kol?, Moshe Shay Ben-Haim?“, 
Qing Dai!?, Ayelet Di Segni*, Mali Salmon-Divon?, Wesley C. Clark®, Guanqun Zheng®, Tao Pan®, Oz Solomon*”, Eran Eyal’, 
Vera Hershkovitz*, Dali Han!’, Louis C. Doré!, Ninette Amariglio*°, Gideon Rechavi** & Chuan He!*® 


Gene expression can be regulated post-transcriptionally through dynamic and reversible RNA modifications. A recent 
noteworthy example is N°-methyladenosine (m°A), which affects messenger RNA (mRNA) localization, stability, 
translation and splicing. Here we report on a new mRNA modification, N'!-methyladenosine (m!A), that occurs on 
thousands of different gene transcripts in eukaryotic cells, from yeast to mammals, at an estimated average transcript 
stoichiometry of 20% in humans. Employing newly developed sequencing approaches, we show that m!A is enriched 
around the start codon upstream of the first splice site: it preferentially decorates more structured regions around 
canonical and alternative translation initiation sites, is dynamic in response to physiological conditions, and correlates 
positively with protein production. These unique features are highly conserved in mouse and human cells, strongly 
indicating a functional role for m!A in promoting translation of methylated mRNA. 


Four decades after it was first observed as an abundant constituent of 
the eukaryotic transcriptome, m®A has only recently been revealed to 
broadly affect mRNA metabolism!”. Consequently, a broader notion 
has emerged that dynamic chemical modifications of mRNA, now 
also encompassing pseudouridine (W)*° and 5-methylcytosine®’, 
have vital roles in post-transcriptional gene regulation. With the 
epitranscriptome, as these modifications are now known, RNA joins 
both DNA and proteins in being subject to dynamic and reversible 
chemical control®’. Unlike the latter two, however, the functions of 
RNA chemical adducts are only now being determined. It is likely 
that the complement of mRNA modifications, at present much 
smaller than that found on transfer RNA (tRNA) and ribosomal 
RNA (rRNA)?®, is far from complete. 


Transcriptome-wide mapping of m!A 

m'A was identified in total RNA from several species decades 
ago!)!?_ It was later characterized in tRNA}? and rRNA" where 
it has a major influence on structure and function, owing to the 
presence of both a methyl adduct and a positive charge under 
physiological conditions’*. Since m!A can rearrange to m°A under 
alkaline conditions (Dimroth rearrangement)", its signature in 
mRNA might be erased by the purification and measurement pro- 
cess, simultaneously introducing noise in m®A measurements (Fig. la 
and Extended Data Fig. 1a), as earlier attempts to survey MRNA 
base modifications acknowledged'’. To determine whether m!A is 
present in mRNA and to quantify it, we performed quantitative mass 
spectrometry (LC-MS/MS) of pure mRNA preparations under con- 
trolled conditions (Supplementary Note 1, Extended Data Fig. 1b-d) 
and obtained m'A/A molar ratios in several human and mouse cell 
lines and tissues, both normal and cancerous. Our results unequivo- 
cally show that m!A is present in pure mammalian mRNA, at m'A/A 
ratios ranging from approximately 0.015% to 0.054% in cell lines 
and up to 0.16% in tissues (Figs 1b, 5d and Extended Data Fig. le). 


Metabolic labelling of cells with deuterated methionine (d3-Met) 
results in detection of d3-m'A in mRNA, suggesting that S-adenosyl- 
methionine (SAM) is the methyl donor (Extended Data Fig. 1f). 
Although these ratios are smaller than the m°A/A and W/U ratios 
in the same samples (Fig. 1b), the absolute and fractional levels 
of m'A in mRNA (see stoichiometry below) are sufficiently high 
to exhibit potential regulatory functions of selectively methylated 
transcripts. 

We adapted an antibody-based approach, methylated RNA 
immunoprecipitation sequencing (MeRIP-seq), termed here 
m!A-seq, for transcriptome-wide localization of m!A sites and coupled 
it to an orthogonal chemical method based on Dimroth rearrange- 
ment to obtain high-resolution m!A maps (Supplementary Notes 2, 3; 
Fig. 1c-e and Extended Data Fig. 1g-m). We applied m'A-seq to 
polyA* RNA from HeLa cells in two replicates and determined the 
methylated positions (m'A peaks) using the MACS peak-calling 
algorithm!®. We identified 7,154 peaks (fold change (FC) >2, false dis- 
covery rate (FDR) < 5%) in 4,151 coding and 63 non-coding adequately 
expressed (that is, above the 1st quartile of expressed genes) gene 
transcripts that occurred in both replicates (Fig. 2a, Supplementary 
Tables 1, 4). On average, each methylated gene carries 1.4 peaks, with 
most of these genes (over 70%) methylated only once (Fig. 2b). Ina 
conservative estimate, 35% of adequately expressed genes are methyl- 
ated, attesting to the breadth of the phenomenon. Of note, non-coding 
genes are significantly underrepresented among methylated genes 
(P=1x 10~°, hypergeometric test). Methylated genes constitute a 
progressively larger fraction of genes as expression increases (Fig. 2c), 
and are significantly enriched in several gene ontology terms related to 
translation and RNA processing (Extended Data Fig. 2a, b). 

As most methylated transcripts harbour a single m'A, methylation 
stoichiometry can be estimated based on the change in transcript levels 
following immunodepletion. We used microarrays to determine 
relative transcript abundance in HepG2 RNA before and after depletion 
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Figure 1 | Development of m'A-seq to map a newly identified 
constituent of mammalian mRNA. a, Chemical structures of m!A and 
m°A. Methyl groups (-CHs) are in red and the positive charge (+) on 
m/!A is in blue. b, LC-MS/MS quantitation of m'A, m°A and W in human 
and mouse mRNA isolated from the indicated cell types. The level of 
each modified nucleoside is indicated as a percentage of the unmodified 
one. Mean values + s.e.m. are shown, n = 3. MEFs, mouse embryonic 
fibroblasts; mESC, mouse embryonic stem cells; M3 KO, Mettl3 knockout. 
c, Schematic outline of m'A-seq. RNA is fragmented and subjected to 
immunoprecipitation using anti-m'A antibody. Eluted RNA fragments 
are converted to cDNA and sequenced, or treated to induce partial 
m/A-to-m®°A rearrangement before cDNA synthesis. mA causes 


with anti-m'A antibody, and found that the average methylation level 
in single m!A-containing genes is ~20% (Fig. 2e, Extended Data Fig. 2c, 
Supplementary Table 7). The presence of m'A ina substantial fraction 
of mRNA transcripts suggests a potentially sizeable effect on mRNA 
metabolism. 
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Figure 2 | m!A is associated with translation initiation sites (TISs) 
in the human transcriptome. a, Representative plot of a methylated 
transcript. Normalized sequence coverage of immunoprecipitation 
(m!A IP, red) and input (blue) are indicated above the gene architecture 
in UCSC format. Thin black boxes represent the 5’ and 3’ untranslated 
regions (UTRs), thick black boxes represent the coding sequences 
(CDS) and thin lines represent introns. b, The percentage of methylated 
genes that carry 1, 2, 3 or 4+ peaks per gene in the indicated human 
cell types. Most methylated genes carry only one m‘A peak. c, The 
percentage of methylated genes in HeLa cells increases with expression 
level. d, Pie charts presenting the fraction of m'A peaks in each of three 
non-overlapping transcript segments (5’ UTR, CDS and 3’ UTR) in the 
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(higher mismatch rate) (lower mismatch rate) 


both reverse transcription stops and read-throughs accompanied by 
mismatches, to produce typical peaks with a central trough and an 
adenosine with increased mismatch rate (left). Partial rearrangement 
of m!A to m°A attenuates the effect (right). m’A is in red, m°A is 

in grey; mismatch rate is illustrated as the ratio between A (red) 

and T (black). d, Detection of the known m!A 1322 site (yellow) in 
human 28S rRNA validates m'A-seq (black curve). High fold-change 
in mismatch rate (before and after m'A-to-m°A rearrangement in 
immunoprecipitation, blue lines) independently identifies m'A 1322 with 
single-nucleotide resolution. e, Mismatch rates in m!A 1322 increase 
after immunoprecipitation and decrease upon rearrangement. 


m!A associates with translation initiation sites and the 
first splice site 

We found that m!A peaks appear in all segments of transcripts—5’ 
untranslated region (UTR), coding sequence (CDS) and 3’ UTR—but 
are markedly under-represented in the latter (~fivefold depletion 


c 
m1 peak/gene 13 peaks/gene 30 HeLa 
2 peaks/gene mi 4+ peaks/gene 3 = 
we 
23 20 
rad 
£5 15 
5 = 10 
be 5 
Oo 3 gdm an 
a 1234 5 678 9 10 
HeLa HepG2 — HEK293 Expression decile 
Stop g — All genes — AUG in 2nd exon 
+ — HeLa, miA — AUG in 1st exon — AUG in 3rd+ exon 
+ —HepG2, m'A 2 
‘ —HEK293,m'A_ $12 
+ —HepG2, m°A 2 
e 8 
xe) 
24 
mo 8 
7 2 0 
a -1,000 -500 0 500 1,000 


3° UTR Distance from 1st splice site (nucleotides) 

indicated human cell types. e, Stoichiometry (fraction) of methylated 
transcripts in genes that carry one mA peak in HepG2 mRNA. Plotted 
is the number of genes sorted by percentage of methylated transcripts. 
Dashed lines define the stoichiometry interquartile range (17-23%). 

f, Metagene profiles of m'A peak distribution along a normalized 
transcript composed of three rescaled non-overlapping segments 
illustrated below, in the indicated human cell types. m!A peaks cluster 
around the AUG start codon. For comparison, the distribution of m°A 
peaks in HepG2 is superimposed. g, Metagene profiles of m!A peak 
distribution in a non-normalized window centred on the first splice site. 
Peaks are sorted by the exon containing the AUG codon in that gene. 


© 2016 Macmillan Publishers Limited. All rights reserved 


relative to chance, P< 1 x 107-7, hypergeometric test, Fig. 2d). m!A-seq 
of two additional cell lines in replicates, HepG2 and HEK293, produced 
similar distribution patterns (Fig. 2b, d, Extended Data Fig. 2d—g and 
Supplementary Tables 1, 4). We noted that while input sequence reads 
have a typical distribution along transcripts, m'A immunoprecipitation 
reads pile up towards their 5’ ends (Extended Data Fig. 3a). To investigate 
this further, metagene profiles were generated in which gene segments 
were rescaled proportionally so that all segments of one kind appear to 
have the same length. This revealed that m'A clusters sharply around the 
start codon to produce a remarkably similar profile in the three human 
cell lines (Fig. 2f). Over 50% of the peaks reside within a stretch of 300 
nucleotides centred on the start codon (Extended Data Fig. 3b-g). 

Ribosome profiling’’, and a recently modified version that captures 
initiating 80S ribosomes”*”!, have uncovered a surprising variety of 
alternative translation initiation sites (TISs). We examined our data sets 
from this perspective and found that methylated transcripts have, on 
average, more alternative TISs than non-methylated ones (1.98 versus 
1.59 in HeLa; 2.37 versus 1.69 in HEK293), and that the number of m‘A 
sites per transcript positively correlates with the number of alternative 
TISs per transcript (Extended Data Fig. 4a—c). Furthermore, tran- 
scripts with upstream TISs are more likely to be methylated within the 
5’ UTR, whereas transcripts with downstream TISs tend to be methyl- 
ated within the CDS (Extended Data Fig. 4d, e). The distances of 
alternative TISs and respective m'A peaks from the annotated start 
codon are well correlated, particularly those of upstream TISs and 5’ 
UTR m!A peaks, suggesting that m‘A is associated with alternative 
TISs as well (Extended Data Fig. 4f-h). In summary, m!A in human 
mRNA seems to be associated with a subset of TISs, both canonical and 
alternative, in all three cell lines examined. 

To gain a deeper understanding of the underlying transcript attribute 
or processing event that potentially guides m'A deposition to create 
the observed distribution, we binned methylated transcripts based on 
the exon that harbours the start codon and plotted the distribution of 
mA relative to the start codon. We observed an intriguing dissocia- 
tion: when the start codon is in the 2nd or 3rd exons, m!A tends to 
occur closer to the transcription start site (TSS) compared to when 
the start codon is in the 1st exon (Extended Data Fig. 3h). Whereas 
5’ UTRs of transcripts with a start codon in downstream exons are 
on average longer (323 versus 189 nucleotides), their first splicing 
event tends to occur closer to the TSS (204 versus 359 nucleotides), 
mirroring m!A behaviour. Therefore, we plotted the distribution of 
m!A relative to the TSS in bins based on the length of the 1st exon and 
observed that m'A moves away from the TSS as the length of the first 
exon increases (Extended Data Fig. 3i). These observations suggest 
that neither the TSS nor the start codon ‘anchors’ m!A, but rather the 
first splice site does. We substantiated this by plotting m'A relative to 
the nearest splice site (which is also the first splice site for 85% of m'A 
peaks) and observed convergence of m!A distribution in all start codon 
bins mostly upstream of the splice site, engendering the hypothesis 
that the first splicing reaction somehow guides m!A deposition 
(Fig. 2g, Extended Data Fig. 3j). Analysis based on the smaller high- 
resolution mA trough data set recapitulated all the salient features of 
the methylome (Extended Data Fig. 3k-q, Supplementary Note 3 and 
Supplementary Table 6). 


Conserved features of methylated transcripts 

We next sought to identify sequence patterns that may define or asso- 
ciate with m!A peak locations. GC content, a key determinant of RNA 
structure, is unusually high in m!A peak areas that appear in all three 
transcript segments (Extended Data Fig. 5a). Several motif detection 
algorithms showed enrichment of GC-rich sequences with variable 
adenosine content, yet did not identify any specific motif per se (other 
than the ‘AUG itself which is expected to appear in peaks centred on the 
start codon) (Extended Data Fig. 5b). This suggests that a combination 
of sequence and structure features may specify methylation positions, 
perhaps similar to the case of m'A deposition in tRNA”. We were 
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Figure 3 | m‘A occurs in GC-rich sequence contexts and in genes 

with structured 5’ UTRs. a, Sequence frequency logo for a set of 

192 adenosines in peak areas that have a higher mismatch rate in 
immunoprecipitation relative to input (FC > 6) in HepG2 demonstrates 
the GC-rich context of m!A. b, Length-adjusted minimum free energy 
(aMFE) for 5’ UTRs of methylated, unmethylated and all genes. Box limits 
represent 25th percentile, median and 75th percentile, whiskers represent 
2.5 and 97.5 percentiles. P values are indicated, t-test. Analysis was based 
on human common peaks (see Methods). c-e, A sliding window of mean 
GC content (c), AG (d) and PARS scores (e) in a+150-nucleotide window 
centred on the AUG start codon of methylated genes compared to all the 
rest, in HepG2 cells. P values are indicated; Kolmogorov—Smirnov test, 
t-test and Kolmogorov—Smirnov test, respectively. 


able to identify and score single-nucleotide m'A positions based on the 
change in mismatch rates between (1) untreated and alkaline-treated 
(m!A-to-m°A rearranged) immunoprecipitation sequence reads and 
between (2) input and immunoprecipitation (Supplementary Table 5). 
A sequence frequency logo of identified positions (Fig. 3a) and repre- 
sentative individual sequences (Extended Data Fig. 5c) demonstrate 
that m!A sites are embedded in GC-rich environments. Using the m'A 
trough data set we also identified a GA-rich motif that was present in 
10-15% of the cases (Extended Data Fig. 3r). 

Attributes of the 5’ UTR such as length, GC content, and associated 
minimum free energy (MFE), are known to affect the efficiency of 
translation initiation”. 5’ UTRs of methylated transcripts differ sig- 
nificantly from those of non-methylated ones in that their GC content 
is higher (median of 71% compared to 63.9%) (Extended Data Fig. 5d) 
and their length-adjusted MFE (aMFE) is lower (median of —43.5 kcal 
mol! compared to —38.4kcal mol!) (Fig. 3b, Extended Data Fig. 5e). 
We employed a sliding window approach to further characterize the 
continuous structural landscape of the AUG window (+150 nucleo- 
tides) in terms of GC content, MFE and experimental parallel analysis 
of RNA structure (PARS) score (Fig. 3c—e, Extended Data Fig. 5f, g). 
Both the experimental and calculated parameters agree that m'A 
decorates highly structured AUG windows. 


Evolutionary conservation and dynamics 

Evolutionary conservation typically indicates a common funda- 
mental biological function. We show above that m!A is present in 
murine mRNA at comparable levels to those in human (Fig. 1b). 
Therefore, m'A-seq was applied to RNA of mouse liver, mouse embry- 
onic fibroblasts (MEFs) and mouse embryonic stem cells (mESC) 
(Supplementary Tables 2, 4). A previously unreported single m!A posi- 
tion in mouse 28S rRNA was identified in an orthologous location to the 
known m!A position in human 28S rRNA (Extended Data Fig. 6a—c). 
The mouse m!A methylome closely resembles that of human: approxi- 
mately 15% of adequately expressed genes are methylated, the majority 
of which carry a single m!A (Fig. 4a), and the percentage of methylated 
genes increases with expression level (Fig. 4b). m'A in mouse mRNA 
is non-randomly distributed along transcripts with dense clustering 
around translation initiation sites, mirroring the profile in human 
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Figure 4 | m!A methylome conservation between human and mouse. 
a, The percentage of methylated genes that carry 1, 2, 3 or 4+ peaks per 
gene in the indicated mouse cell types. Most genes carry only one m'A 
peak. b, The percentage of methylated genes in mouse liver cells increases 
with expression level. c, Pie chart presenting the fraction of m'A peaks in 
each of three non-overlapping transcript segments (5’ UTR, CDS and 
3’ UTR) in mouse liver. d, Metagene profiles of m!A peak distribution 
along a normalized transcript composed of three rescaled non-overlapping 
segments illustrated below, in the indicated mouse cell types. m'A peaks 
cluster around the AUG start codon. e, Metagene profiles of mouse mA 
peak distribution in a non-normalized window centred on the nearest 
splice site. Peaks are sorted by the exon containing the AUG codon in that 
gene. f, Length-adjusted minimum free energy (aMFE) for 5’ UTRs of 
methylated, unmethylated and all genes in mouse liver. Box limits 
and whiskers are as indicated in Fig. 3b. P values are indicated, t-test. 
g, Human-mouse m!A conservation expressed as per cent of orthologous 
positions with shared m'A peaks according to their location in the 
transcript. 


(Fig. 4c—d, Extended Data Fig. 6d-I), including the association with 
the nearest splice site (Fig. 4e, Extended Data Fig. 6m). Similarly, m'A 
sites reside within a GC-rich context (Extended Data Fig. 7a, b), and 
the 5’ UTRs of methylated transcripts have a higher GC content anda 
lower aMFE than those of all other (unmethylated) transcripts (Fig. 4f 
and Extended Data Fig. 7c-f). 

We systematically assessed the extent of m'A peak conservation 
between human and mouse and found that it extends to the gene level: 
out of 1,338 identified mouse liver m'A peaks that could be mapped to 
adequately expressed, orthologous positions in human HepG2 cells, 441 
had an m!A peak at the orthologous position, representing 33% overall 
conservation (P< 1.3 x 1073"°, Mann-Whitney U test) (Fig. 4g, Extended 
Data Fig. 7g). The 5’ UTR and a 300-nucleotide window centred on the 
AUG codon exhibit the highest degree of conservation with 44-46% 
conserved peaks, compared to 3-16% in other gene segments (Fig. 4g). 

We applied m!A-seq to RNA of Saccharomyces cerevisiae and 
Schizosaccharomyces pombe to further examine this mark in simple 
eukaryotes, and found it to be present in their mRNA transcriptomes, 
albeit without the characteristic mammalian pattern (Extended Data 
Fig. 8a-g, Supplementary Tables 3, 4). Importantly, upon transfer to 
a nitrogen-source deficient medium the S. pombe methylome exhib- 
ited noticeable changes in the identity of methylated transcripts (some 
transcripts gained m!A while others lost it), providing an example of 
how physiological conditions dynamically shape m'A (Extended Data 
Fig. 8h, i). These results further demonstrate the conservation of mA 
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as a mark in eukaryotic mRNA, although the distribution pattern and 
functions in lower eukaryotes could be different from mammals, as 
also observed for m°A”>”®, 

Next, using quantitative LC-MS/MS we surveyed a range of physiolog- 
ical perturbations that could alter the total m'A level in mRNA of mam- 
malian cells. We found that glucose or amino acid starvation reduced 
mA level by approximately threefold and twofold, respectively, as early 
as 4h; that heat shock increased m'A level by approximately 1.5-fold 
at 4h; and that serum starvation resulted in only a modest change 
within the same time frame (Fig. 5a, Extended Data Fig. 9a, b). Of 
note, m°A levels measured in parallel do not change, or change slightly 
in the opposite direction, suggesting that the response is m!A-specific 
(Fig. 5a). m'A-seq performed after 4h of glucose starvation or heat 
shock uncovered a more complex picture: while the overall number of 
m!A peaks decreased (by 17%) and increased (by 23%), respectively, 
mirroring the LC-MS/MS measurement, still some individual positions 
behaved in the opposite direction (Fig. 5b, c; Extended Data Fig. 9c, d). 

The m!A levels across various tissues obtained from a single mouse 
also showed significant variability, with kidney and brain possessing the 
highest m!A level (about threefold higher than that measured in liver, 
for example) (Fig. 5d). A comparison between tissues from lean (wt/wt) 
and obese mice (ob/ob) revealed that brains of the former contain 75% 
more m!A (Fig. 5d). Insum, m!A in mRNA is a dynamic modification 
in response to certain stress conditions and potentially physiological 
signalling, and its level varies between different tissues. 

SAM-dependent methyltransferases that install m'A on tRNA and 
rRNA are known in eukaryotes”*”’. These or related homologues could 
mediate mRNA m!A methylation. FTO, the first RNA demethylase to 
be discovered, belongs to the AlkB family of proteins, and catalyses 
oxidative demethylation of m°A in mRNA”®. Other members of the 
AIkB family are known to perform oxidative demethylation of m'A 
and m°C lesions in DNA/RNA. In particular, ALKBH3 was shown 
to have a strong preference for single stranded DNA/RNA, and to 
demethylate m'A in tRNA and mRNA in vitro”??”. In line with previous 
biochemical observations, overexpression of wild-type ALKBH3, but 
not of an inactive mutant, reduced m!A levels in mRNA (Extended 
Data Fig. 9e). These results suggest that ALKBH3 and its homologues 
are likely candidates that could reverse m'A in mRNA and contribute 
to its dynamic regulation. 


m!A correlates with elevated translation 
Structure and sequence features of the 5’ UTR and both sides of the 
start codon affect the scanning process and thereby the efficiency of 
translation initiation and early elongation*)*”. We show here that m'A 
is a widespread and conserved post-transcriptional modification that is 
associated with TISs in thousands of mammalian transcripts character- 
ized by structured 5’ UTRs. The precise localization of m'A in an area 
of obvious regulatory importance, coupled with its ability to restructure 
RNA in other cases (such as tRNA*) and to affect RNA-protein 
interactions, points to possible involvement in translation initiation. 
We used published ribosome profiling and proteomic data sets, 
obtained from the same cell lines and tissues used for m'A profiling, to 
assess possible effects of m‘A on translation (see Methods). The transla- 
tion efficiency’? (TE) and ribosome release score** (RRS) of genes with a 
single m'A peak in the start codon window compared to genes devoid of 
m/'A altogether are significantly higher (~1.2-1.4, on average) in both 
human and mouse cells (Extended Data Fig. 10). Furthermore, correla- 
tion of m‘A with protein levels normalized to mRNA abundance reveals 
a similar trend: in all cell types examined, methylated genes tend to have 
higher protein levels than non-methylated genes (Fig. 6a—f). The aver- 
age fold-change in protein levels between methylated and non-methyl- 
ated genes in all cell types—and across all gene expression bins—is ~1.7. 
Considering that the average fraction of methylated transcripts is ~20%, 
the isolated true effect of m'A could be even higher. The contribution of 
m!A remains significant even after accounting for multiple covariates 
that are known to affect translation (ANCOVA, Supplementary Table 8). 
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Figure 5 | m'A in mRNA is a dynamic modification that responds 

to changing physiological and stress conditions, and varies between 
tissues. a, LC-MS/MS quantification of m!A (left, grey) and m°A (right, 
black) in mRNA of untreated and glucose-starved (upper panels) or heat 
shock-treated (lower panels) HepG2 cells, presented as percentage of 
unmodified A. Mean values + s.e.m. are shown, n = 3, **P<0.01, not 
significant (NS) P > 0.05, unpaired t-test. b, Venn diagrams of differential 
and shared m!A peaks in untreated and glucose-starved (left upper panel) 


Discussion 

N'-methylation on RNA adenosine occurs at the Watson-Crick inter- 
face and is expected to affect RNA base pairing*®. In addition to an 
added methyl group, this modification also endows the modified aden- 
osine with a positive charge under physiological conditions (Fig. 1a), 
which could markedly alter RNA structure and protein—-RNA inter- 
actions. The impact of m!A on the structure and function of tRNA is 
well documented, where a single m!A can be essential for correct func- 
tional folding of atRNA molecule®’. A single charge difference has been 
shown to induce ~100-1,000-fold affinity difference of protein-DNA 
interactions*®. We show here that m!A is a modification occurring on 
thousands of mRNA transcripts in several human and mouse cell types, 
as well as on hundreds of transcripts in two yeast species. Applying 
m!A-seq, a method relying on immunoprecipitation of m'A-containing 
RNA fragments, we found a striking enrichment of m!A around the start 
codon in human and mouse mRNA. Taking advantage of the Dimroth 
rearrangement, which converts m!A (mutation prone in reverse 
transcription) to m°A (low mutation rate in reverse transcription*’) 
with alkaline treatment, we confirmed the accumulation of m!A around 
the start codon at high resolution. 

Transcriptome-wide mapping of m!A revealed unique features of 
its distribution that strongly indicate functional roles: (1) m!A resides 
close to both canonical and alternative TISs found mostly upstream 
of the first splice site; (2) m‘A is present in highly structured regions 
around the start codon; (3) m!A positively correlates with translation 
efficiency and protein level; (4) the distribution pattern and m!A peaks 
are highly conserved in all mouse and human cell types examined; and 
(5) m!A is dynamic in response to certain stress, and potentially physio- 
logical, signals, and its level varies across mouse tissues. Together, these 
attributes suggest a positive and dynamic role for m!A in translation 
initiation in mammalian cells. The importance of RNA structure in 
translation initiation is well appreciated**. A positively charged m!A 
may alter the secondary/tertiary structure of mRNA around TISs by 
blocking Watson-Crick base pairing or introducing charge—-charge 
interactions. Alternatively, potential binding proteins may specifi- 
cally recognize m'A and facilitate translation initiation of methylated 


ob/ob wt/wt 


or heat shock-treated (left lower panel) HepG2 cells. The overall change in the 
number of identified m'A peaks is presented as per cent of untreated (right 
upper and lower panels). c, Representative plots of differentially methylated 
transcripts in untreated and glucose starved (upper panel) or heat-shock- 
treated (lower panel) HepG2 cells. Plot format as in Fig. 2a. Yellow boxes frame 
differential peaks. d, LC-MS/MS quantification of m!A in the indicated wt/ 

wt and ob/ob mouse tissues, presented as percentage of unmodified A. Mean 
values + s.e.m. are shown, n= 3, *P<0.05, NS, P> 0.05, one-way ANOVA. 


transcripts in a way analogous to the role of the m°A reader YTHDF1 
in translation enhancement. Either way, it is tempting to speculate 
that m'A can modulate the strength of a particular translation initiation 
site, thereby affecting the balance between alternative and canonical 
initiation sites. 

In contrast to m®A, which is enriched around the stop codon and in 
the 3’ UTR'”, m!A preferentially associates with the start codon region 
in mammalian mRNA. These two marks could complement each other 
in controlling mRNA metabolism and translation. A single m°A mod- 
ification usually induces ~10-fold change in either duplex stability or 
protein-RNA affinity, as would be expected by the presence of an extra 
methyl group**“’. The propensity of m°A to cluster at 3’ UTRs also 
enables a wider range of dynamic tuning than a single m®A site could 
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Figure 6 | mA around the start codon correlates with higher protein 
levels. a-f, Genes methylated in a 300-nucleotide window centred on the 
start codon have higher protein levels than all other genes with similar 
RNA expression levels in HepG2 (a), HEK293 (b), HeLa (c), mESC (d), 
MEFs (e) and mouse liver (f). ANOVA m!A P values against log, protein 
levels and the average fold-change (Avg. FC) of median protein levels 
across all gene expression bins (Start m!A/Non-m!A) are indicated. 
FPKM, fragments per kilobase of transcript per million mapped reads. 
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offer!”. The unique chemical properties of m'A, with both a positive 
charge and a methyl group, potentially allow each individual modified 
site to have a stronger effect than m°A in terms of RNA structure or 
protein-RNA interaction. 

The discovery of this new mRNA mark that positively correlates 
with translation adds an exciting new dimension to post-transcriptional 
gene expression regulation mediated through mRNA modifications. 
While our paper was in proof, we were alerted to a study by Chengqi 
Yi and colleagues reporting similar findings on the m'A methylome 
in a human cell line. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 


Received 2 July 2015; accepted 15 January 2016. 
Published online 10 February 2016. 


1. Dominissini, D. et al. Topology of the human and mouse m6A RNA 
methylomes revealed by m6A-seq. Nature 485, 201-206 (2012). 

2. Meyer, K. D. et al. Comprehensive analysis of mRNA methylation reveals 
enrichment in 3’ UTRs and near stop codons. Cel! 149, 1635-1646 (2012). 

3. Carlile, T. M. et a/. Pseudouridine profiling reveals regulated mRNA 
pseudouridylation in yeast and human cells. Nature 515, 143-146 (2014). 

4. Schwartz, S. et al. Transcriptome-wide mapping reveals widespread 
dynamic-regulated pseudouridylation of ncRNA and mRNA. Cel/ 159, 
148-162 (2014). 

5. Li, X. et al. Chemical pulldown reveals dynamic pseudouridylation of the 
mammalian transcriptome. Nature Chem. Biol. 11, 592-597 (2015). 

6. Squires, J. E. et al. Widespread occurrence of 5-methylcytosine in human 
coding and non-coding RNA. Nucleic Acids Res. 40, 5023-5033 (2012). 

7. Khoddami, V. & Cairns, B. R. Identification of direct targets and modified 
bases of RNA cytosine methyltransferases. Nature Biotechnol. 31, 458-464 
(2013). 

8. Fu, Y., Dominissini, D., Rechavi, G. & He, C. Gene expression regulation 
mediated through reversible m®A RNA methylation. Nature Rev. Genet. 15, 
293-306 (2014). 

9. Li, S. & Mason, C. E. The pivotal regulatory landscape of RNA modifications. 
Annu. Rev. Genomics Hum. Genet. 15, 127-150 (2014). 

10. Machnicka, M. A. et al. MODOMICS: a database of RNA modification 
pathways—2013 update. Nucleic Acids Res. 41, D262-D267 (2013). 

11. Dunn, D. B. The occurrence of 1-methyladenine in ribonucleic acid. 

Biochim. Biophys. Acta 46, 198-200 (1961). 

12. Hall, R. H. Method for isolation of 2’-O-methylribonucleotides and 
N1-methyladenosine from ribonucleic acid. Biochim. Biophys. Acta 68, 
278-283 (1963). 

13. El Yacoubi, B., Bailly, M. & de Crécy-Lagard, V. Biosynthesis and function of 
posttranscriptional modifications of transfer RNAs. Annu. Rev. Genet. 46, 
69-95 (2012). 

14. Sharma, S., Watzinger, P., Kotter, P. & Entian, K. D. Identification of a novel 
methyltransferase, Bmt2, responsible for the N-1-methyl-adenosine base 
modification of 25S rRNA in Saccharomyces cerevisiae. Nucleic Acids Res. 41, 
5428-5443 (2013). 

15. Anderson, J. T. & Droogmans, L. in Fine-Tuning of RNA Functions by Modification 
and Editing Vol. 12 (ed. H. Grosjean) 121-139 (2005). 

16. Macon, J. B. & Wolfenden, R. 1-Methyladenosine. Dimroth rearrangement and 
reversible reduction. Biochemistry 7, 3453-3458 (1968). 

17. Desrosiers, R., Friderici, K. & Rottman, F. Identification of methylated 
nucleosides in messenger RNA from Novikoff hepatoma cells. Proc. Nat! Acad. 
Sci. USA 71, 3971-3975 (1974). 

18. Zhang, Y. et al. Model-based analysis of ChIP-Seq (MACS). Genome Biol. 9, 
R137 (2008). 

19. Ingolia, N. T., Lareau, L. F. & Weissman, J. S. Ribosome profiling of mouse 
embryonic stem cells reveals the complexity and dynamics of mammalian 
proteomes. Cell 147, 789-802 (2011). 

20. Lee, S. et a/. Global mapping of translation initiation sites in mammalian cells 
at single-nucleotide resolution. Proc. Nat! Acad. Sci. USA 109, E2424-F2432 
(2012). 

21. Gao, X. et al. Quantitative profiling of initiating ribosomes in vivo. Nature 
Methods 12, 147-153 (2015). 

22. Takuma, H. et al. Substrate tRNA recognition mechanism of eubacterial 
tRNA (m1A58) methyltransferase (Trml). J. Biol. Chem. 290, 5912-5925 
(2015). 

23. Ozanick, S., Krecic, A. Andersland, J. & Anderson, J. T. The bipartite structure of 
the tRNA m1A58 methyltransferase from S. cerevisiae is conserved in humans. 
RNA 11, 1281-1290 (2005). 

24. Kozak, M. Regulation of translation via mRNA structure in prokaryotes and 
eukaryotes. Gene 361, 13-37 (2005). 


446 | NATURE | VOL 530 | 25 FEBRUARY 2016 


25. Clancy, M. J., Shambaugh, M. E., Timpte, C. S. & Bokar, J. A. Induction of 
sporulation in Saccharomyces cerevisiae leads to the formation of 
N6-methyladenosine in mRNA: a potential mechanism for the activity of the 
IME4 gene. Nucleic Acids Res. 30, 4509-4518 (2002). 

26. Schwartz, S. et al. High-resolution mapping reveals a conserved, widespread, 
dynamic mRNA methylation program in yeast meiosis. Ce// 155, 1409-1421 
(2013). 

27. Peifer, C. et al. Yeast Rrp8p, a novel methyltransferase responsible for m!A 
645 base modification of 25S rRNA. Nucleic Acids Res. 41, 1151-1163 (2013). 

28. Jia, G. et al. N6-methyladenosine in nuclear RNA is a major substrate of the 
obesity-associated FTO. Nature Chem. Biol. 7, 885-887 (2011). 

29. Aas, P.A. et al. Human and bacterial oxidative demethylases repair alkylation 
damage in both RNA and DNA. Nature 421, 859-863 (2003). 

30. Ougland, R. et a/. AIkB restores the biological function of mRNA and tRNA 
inactivated by chemical methylation. Mol. Cell 16, 107-116 (2004). 

31. Zur, H. & Tuller, T. New universal rules of eukaryotic translation initiation 
fidelity. PLOS Comput. Biol. 9, €1003136 (2013). 

32. Parsyan, A. et al. mRNA helicases: the tacticians of translational control. 

Nature Rev. Mol. Cell Biol. 12, 235-245 (2011). 

33. Helm, M., Giegé, R. & Florentz, C. A Watson-Crick base-pair-disrupting methyl 

group (m1A9) is sufficient for cloverleaf folding of human mitochondrial 

RNALys. Biochemistry 38, 13338-13346 (1999). 

34. Guttman, M., Russell, P., Ingolia, N. T., Weissman, J. S. & Lander, E. S. Ribosome 

profiling provides evidence that large noncoding RNAs do not encode proteins. 

Cell 154, 240-251 (2013). 

35. Lu, L, Yi, C., Jian, X., Zheng, G. & He, C. Structure determination of DNA 

methylation lesions N!-meA and N?-meC in duplex DNA using a cross-linked 

protein-DNA system. Nucleic Acids Res. 38, 4415-4425 (2010). 

36. He, C. et al. A methylation-dependent electrostatic switch controls DNA repair 

and transcriptional activation by E. coli Ada. Mol. Cell 20, 117-129 (2005). 

37. Harcourt, E. M., Ehnrenschwender, T., Batista, P. J.. Chang, H. Y. & Kool, E. T. 

dentification of a selective polymerase enables detection of N®&- 
methyladenosine in RNA. J. Am. Chem. Soc. 135, 19079-19082 (2013). 

38. Sonenberg, N. & Hinnebusch, A. G. Regulation of translation initiation in 
eukaryotes: mechanisms and biological targets. Cel! 136, 731-745 (2009). 

39. Wang, X. et al, N°-methyladenosine modulates messenger RNA translation 
efficiency. Cell 161, 1388-1399 (2015). 

40. Kierzek, E. & Kierzek, R. The thermodynamic stability of RNA duplexes and 
hairpins containing N®°-alkyladenosines and 2-methylthio-N°-alkyladenosines. 
Nucleic Acids Res. 31, 4472-4480 (2003). 

41. Roost, C. et al. Structure and thermodynamics of N°-methyladenosine in 
RNA: a spring-loaded base modification. J. Am. Chem. Soc. 137, 2107-2115 
(2015). 

42. Li, X. et al. Transcriptome-wide mapping reveals reversible and dynamic 
N'-methyladenosine methylome. Nat. Chem. Biol. http://dx.doi.org/10.1038/ 
nchembio.2040 (2016). 


Supplementary Information is available in the online version of the paper. 


Acknowledgements The work was supported by the National Institutes of 
Health HGO08688 and GM71440 grants to C.H., GM113194 grant to T.P. and 
C.H. and grants from the Flight Attendant Medical Research Institute (FAMRI), 
Israel Science Foundation (ISF grant no. 1667/12), Israeli Centers of Excellence 
(I-CORE) Program (ISF grants no. 41/11 and no. 1796/12), Ernest and Bonnie 
Beutler Research Program and Kahn Family Foundation to G.R. A part of this 
work was funded by the Chicago Biomedical Consortium with support from 
the Searle Funds at The Chicago Community Trust (to D.D. and C.H.). C.H. is an 
investigator of the Howard Hughes Medical Institute (HHMI). G.R. is a member 
of the Sagol Neuroscience Network and holds the Djerassi Chair for Oncology 

at the Sackler Faculty of Medicine, Tel-Aviv University, Israel. D.D. is supported 
by a Human Frontier Science Program (HFSP) long-term fellowship. S.N. is an 
HHMI Fellow of the Damon Runyon Cancer Research Foundation (DRG-2215- 
15), previously supported by a Yen post-doctoral fellowship in interdisciplinary 
research. Q.D. is supported by the National Institutes of Health grant HGO06699. 
We wish to thank S. Farage-Barhom, K. Cesarkas and E. Glick-Saar for help with 
deep sequencing. 


Author Contributions D.D. and C.H. conceived the idea. D.D. designed the 
experiments and analyses, and developed the method. D.D., S.N., and S.M.-M. 
performed experiments. A.D.S. performed PARS structural mapping. V.H., Q.D., 
and G.Z. helped with experiments. Q.D. synthesized deuterated standards. D.D., 
S.M.-M., S.N., M.S.B.-H., E.P.,, N.K., M.S.-D., O.S., E.E., W.C.C. and D.H. performed 
analyses and interpreted findings. D.D., S.M.-M., S.N. and C.H. wrote the 
manuscript with input from M.S.B.-H., E.P., L-C.D., W.C.C., G.Z., T.P., N.A. and G.R. 


Author Information Data have been deposited in the NCBI Gene Expression 
Omnibus (GEO) and are accessible through GEO series accession number 
GSE70485. Reprints and permissions information is available at www.nature. 
com/reprints. The authors declare no competing financial interests. Readers are 
welcome to comment on the online version of the paper. Correspondence and 
requests for materials should be addressed to D.D. (dandominissini@gmail.com), 
GR. (gidi.rechavi@sheba.health.gov.il) or C.H. (chuanhe@uchicago.edu). 


© 2016 Macmillan Publishers Limited. All rights reserved 


METHODS 

Cells and tissues. Human HeLa (cervical adenocarcinoma), HepG2 
(hepatocellular carcinoma) and HEK293 (embryonic kidney) cell lines (ATCC) 
and primary mouse embryonic fibroblasts (MEFs) (C57BL/6, ATCC) were main- 
tained in DMEM (Thermo Fisher Scientific) containing 25 mM glucose, 4mM 
L-glutamine, supplemented with 100 U ml! penicillin, 100,.g ml“! streptomycin 
and 10% FBS. Mouse embryonic stem cells (mESCs) were maintained in FBS-free 
N2B27-based media, as previously described’. Cells were routinely checked for 
mycoplasma contamination. Mouse tissues were obtained from wild-type and 
ob/ob C57BL/6 mice. For metabolic labelling, methionine-devoid DMEM was 
supplemented with L-methionine-(methyl-d3) (Sigma-Aldrich). 

Cell treatments. HepG2 cells were maintained as described above and subjected 
to the following growth conditions: (1) heat shock (43°C for 4h); (2) glucose 
starvation (glucose-depleted medium for 4h); (3) amino acid starvation (amino 
acid-depleted medium, with or without 200 1M S-adenosyl-methionine (SAM) 
supplementation, for 4h); and (4) serum starvation (serum-depleted medium 
for 4h), following which they were immediately harvested in Cell Lysis Solution 
(5 Prime). 

Yeast strains and growth conditions. Wild-type (BY4741) S. cerevisiae cells were 
grown vegetatively in YPD medium to mid-log phase and harvested. Wild-type 
heterothallic (Sp1) S. pombe cells were grown vegetatively in YES medium to mid- 
log phase and harvested. For induction of sporulation/arrest, Sp1 cells were washed 
and transferred to Edinburgh Minimal Medium lacking NH4Cl (EMM-N) for 4h 
and harvested. 

RNA purification. Total RNA from cells in culture and mouse tissues was purified 
using PerfectPure RNA Cultured Cell Kit (5 Prime) and PerfectPure RNA Tissue 
Kit (5 Prime), respectively, and DNase-treated. Enrichment of polyadenylated RNA 
(polyA * RNA) from total RNA was carried out using one round of GenElute 
mRNA Miniprep Kit (Sigma-Aldrich). Ribo-Zero Gold rRNA Removal Kit 
(Ilumina) was used to deplete rRNA from polyA* RNA before LC-MS/MS. 

Dot blot assays. RNA Oligonucleotides were synthesized in-house with either 
m!A, m°A or A ata single internal position (5' -AC(m!A/m°A/A)UG-3’), spotted 
onto a nylon membrane (GE Healthcare) in decreasing amounts (1,000, 200, 40 
and 8 pmol) and UV-crosslinked. Membranes were blocked with 5% non-fat dry 
milk in 1 x PBST (blocking buffer) for 1h at 25°C, and incubated overnight with 
either mouse anti-m!A antibody (11g ml~!, MBL) or rabbit anti-m°A antibody 
(1pg ml~!, Synaptic Systems) in 1 x PBST at 4°C. Following extensive washing 
with 1 x PBST, membranes were incubated with either HRP-conjugated goat 
anti-mouse IgG or anti-rabbit IgG antibody (1:2,500, Thermo Fisher Scientific) 
in blocking buffer for 1h at 25°C. Membranes were washed in 1 x PBST, devel- 
oped with ECL substrate (Thermo Fisher Scientific) and imaged with FluorChem 
imager (Protein Simple) or X-ray film. Competitive dot blots were performed on 
separate membranes spotted with 75 pmol of the m!A-containing oligonucleotide 
by co-incubation of anti-m'A antibody with increasing concentrations of either 
m!A or m°A competitor mononucleoside (0, 1, 2 and 41M). 

m!A detection and quantitation. Purified RNA (see ‘RNA purification’ section 
above) was subjected to liquid chromatography-tandem mass spectrometry 
(LC-MS/MS) for detection and accurate quantitation of m!A, essentially as pre- 
viously described“. 200-400 ng of purified RNA was digested by P1 nuclease 
(Wako, 2 U) in 401 of buffer containing 25mM NaCl and 2.5mM ZnCl for 
2h at 37°C. Subsequently, 5 units (1,11) of Antarctic Phosphatase (New England 
Biolabs) and 1 x Antarctic Phosphatase reaction buffer were added and the sam- 
ple was incubated for another 2h at 37°C. The sample was then filtered (0.22 1m, 
Millipore) and injected into a C18 reverse phase column coupled on-line to Agilent 
6410 QQQ triple-quadrupole LC mass spectrometer in positive electrospray ion- 
ization mode. Quantitation was performed based on nucleoside-to-base ion tran- 
sitions (268-to-136 for A; 282-to-150 for m°A and m!A (retention times of 2.5 
and 0.9 min, respectively); 285-to-153 for d;-m!A and d3-m°A; 245-to-179 for U; 
245-to-113 for U) using standard curves of pure nucleosides and stable isotope- 
labelled internal standards (d3-m!A and d3-m°A), as previously described*>~*”. 
Synthesis of stable isotope-labelled internal standards. d;-m!A was synthesized 
following a previously described procedure for the synthesis of m!A*, replacing 
CHs3I with CD3I. d3-m°A was synthesized following a previously described proce- 
dure for the synthesis of m°A**’, replacing CH;NH) with CD;NH). 

m!A-seq. Mapping of m!A in total or mRNA was performed using m'A-seq, which 
is based on the previously described m°A-seq protocol®! with the following modi- 
fications: RNA fragmentation was performed using RNA Fragmentation Reagents 
(Thermo Fisher Scientific) for 15 min at 70°C to minimize m!A-to-m°A rear- 
rangement followed by column purification (RNA Clean & Concentrator, Zymo). 
Anti-m'A antibody (MBL) was pre-coupled to Protein G Dynabeads (Thermo 
Fisher Scientific) and used to immunoprecipitate methylated RNA fragments for 
3h at 4°C. Fragment elution was carried out by either digestion with Proteinase K 
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(Sigma-Aldrich; 5 units in 5mM Tris pH 7.5, 1mM EDTA and 0.25% SDS for 
1.5h at 37°C) followed by TRIzol (Thermo Fisher Scientific) extraction of the 
supernatant and ethanol precipitation, or by competitive elution using m!A mono- 
nucleoside (Santa Cruz Biotechnology), as in m°A-seq. Induced m!A-to-m°A 
rearrangement was achieved by incubating the input and immunoprecipitation 
fragments in an alkaline buffer (50 mM NazCO3, 2mM EDTA, pH 10.4) for Lh at 
50°C or 60°C followed by column purification (RNA Clean & Concentrator, Zymo) 
(Extended Data Fig. 1m). Immunoprecipitated RNA fragments and comparable 
amounts of input were subjected to first-strand cDNA synthesis using the NEBNext 
Ultra RNA Library Prep Kit for Iumina (New England Biolabs). Sequencing was 
carried out on Illumina HiSeq2500 according to the manufacturer’s instructions, 
using 10 pM template per sample for cluster generation, TruSeq SR Cluster kit v3 
(Illumina), TruSeq SBS Kit v3-HS (Illumina) and TruSeq Multiplex Sequencing 
primer kit (Iumina). Summary of read numbers for each replicate can be found 
in Supplementary Table 4. 

Validation of m!A-seq. Reads were aligned to the 28S ribosomal RNA sequences 
of human (RefSeq NR_003287.2), mouse (RefSeq NR_003279/NCBI X00525) 
and S. cerevisiae (SGD RDN25-2, ID: 000006485.) using Bowtie2” with local 
alignment option (--local). Enrichment was calculated as the coverage ratio of 
immunoprecipitation to input. Mismatch rate was assessed using mpileup tool of 
the SAMtools software package (version 0.1.18)°? for calling variants over the entire 
28S transcript. The frequency of non-adenosine bases at each adenosine position 
and the coverage of each base in the transcript were reported. 

RNA expression level. Reads per kilobase of transcript per million mapped reads 
(RPKM) expression levels of RefSeq genes were calculated using HTseq-count™ 
and R package edgeR®. Fragments per kilobase of transcript per million mapped 
reads (FPKM) values were calculated by the CUFFLINKS tool (version 2.2.1)°°. 
Only genes whose expression level was above the first quartile were considered 
adequately expressed and used for downstream analyses. Adequately expressed 
genes (RPKM) were divided into 10 expression bins for correlation with methyla- 
tion and the fraction of methylated genes in each bin was calculated. 

Peak calling. Adaptors and low quality bases were trimmed from raw sequencing 
reads using cutadapt®”. Reads were aligned to the relevant genome (human-hg19, 
mouse-mm10, S. pombe- ASM294v2.29, and S. cerevisiae- S288C with UTR data 
downloaded from http://genie.weizmann.ac.il/pubs/PARS10/pars10_catalogs. 
html) using Tophat2 (version 2.0.12)°8. Peaks enriched in immunoprecipitation 
over input experiments were identified using MACS2 (version 2.1.0.20140616)'. 
MACS2-identified peaks were intersected with a database of exons of the rele- 
vant genome (RefSeq annotation). Peaks were allocated to the feature containing 
the segment with which they share the largest overlap. Peaks falling in intergenic 
sequences or having an overlap shorter than 50 nucleotides were excluded from 
further analyses. For each cell type only peaks identified (FC > 2 or FC > 4, as 
indicated, FDR < 0.05) in replicates were considered. Common human peaks were 
defined as peaks independently identified in both HeLa and HepG2 RNA. Negative 
peaks were identified by switching the immunoprecipitation and input samples. 
Coverage analysis. The coverage of unique reads at each nucleotide position at the 
transcriptomic level, in immunoprecipitation and input reads, was calculated and 
reported. Only transcripts in which at least one nucleotide exceeded a minimum 
depth of 10 reads in immunoprecipitation were used in further analysis for reduc- 
tion of background noise. For whole-transcript coverage plots, each transcript was 
divided into 100 bins of equal length, and the median coverage for each bin across 
all transcripts was calculated. 

mA stoichiometry. HepG2 mRNA was subjected to immunoprecipitation with 
anti-m!A antibody in 2 biological replicates. Input and immunodepleted (unbound 
sup) RNA samples (75 ng each) were hybridized to PrimeView human gene expres- 
sion microarrays (Affymetrix) and expression levels were determined. Genes 
expressed below the first quartile (as determined by mRNA-seq, see above) were 
set as the minimal intensity limit (blank), and only genes with intensities above 
this threshold were considered. To correct for technical loss of RNA during the 
different steps of the procedure, the average ratio of sup/input intensity was calcu- 
lated for never-methylated genes (considering only genes that were unmethylated 
in all m'A-seq experiments) and used for ratio correction of methylated genes 
carrying one m‘A peak. Fractional methylation level per gene was calculated 
as 1 — (corrected sup/input ratio). 

Identification of m!A-induced mismatches in m'A peaks. Identification of 
sequence variants in m!A peaks due to misincorporation at the mA position dur- 
ing reverse transcription was carried out by determining the base composition 
at each position within peak regions using bam-readcount (https://github.com/ 
genome/bam-readcount). Identified adenosine positions (according to the encod- 
ing DNA strand) were then filtered to exclude known genomic polymorphism sites 
(dbSNP version 141) and identified A-to-I editing sites (Radar database). Variants 
were considered if the mismatch rate was greater than 0.1 and the overall coverage 
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was greater than 20 reads. For HepG2 samples the fold enrichment of mismatch 
rate in untreated over rearranged sequence reads was calculated. 

Trough identification. To identify points of lower coverage inside peak areas, we 
converted the bam files of the immunoprecipitation libraries to BigWig files and 
subjected them to PeakSplitter°? with options of —valley 0.65 and -cutoff 30. The 
trough was identified as the minimal point +10 nucleotides, between the summits 
of identified sub-peaks. Only troughs reoccurring in 3 biological repeats were 
considered. 

Bidirectional conversion of genomic and transcriptomic coordinates. Genomic 
coordinates of all 5’ UTRs, CDS and 3’ UTRs of RefSeq transcripts were down- 
loaded from the University of California, Santa Cruz genome browser (UCSC) 
table browser in BED format (hg19 and mm10 for human and mouse, respec- 
tively). Transcripts with IDs corresponding to more than one genomic location 
were discarded, as were transcripts of non-coding genes. Custom, in-house, awk 
and perl scripts were then used to convert transcriptomic coordinates to genomic 
coordinates and vice versa. 

Definition of peak middle point. MACS2-generated genomic m!A peak coordi- 
nates were converted to transcriptomic coordinates. Neighbouring peaks (at a dis- 
tance of up to 10 nucleotides) on the same transcript were merged using mergeBed 
from the BEDTools package®. This allowed the unification of peaks overlapping 
more than one exon that were divided by MACS2 due to the presence of introns in 
genomic coordinates. The middle of the resulting transcriptomic peak was defined 
as peak middle and referred to as m!A peak position in further analyses. 
Metagene profiling. The relative position of each peak’s middle point was located 
in the corresponding transcript segment (5’ UTR, CDS or 3’ UTR). Each segment 
was assigned a value corresponding to its average length fraction out of the over- 
all transcript length. The corresponding relative peak position was then deduced 
within each segment and plotted along a normalized metagene profile with respect 
to the TSS, AUG start codon and stop codon positions. The distribution of m'A 
peaks with respect to the canonical AUG start codon, TSS, first and nearest splice 
sites, and stop codon was also drawn in actual nucleotide distances. 

Sliding window analysis of AUG start codon window. Structural features of a 
300-nucleotide window centred on the AUG start codon were compared between 
methylated and non-methylated RefSeq genes. GC content was calculated as the 
average percent in a sliding window of 3 nucleotides. Local free energy (AG) was cal- 
culated in a 30-nucleotide consecutive jumping window (in steps of 10 nucleotides) 
using the RNAfold tool in the ViennaRNA package“! with default parameter settings. 
Minimum free energy (MFE) calculation. The 5’ UTR sequence of the predom- 
inant isoform of each gene (the highest expressed isoform in the input sample) 
was used to calculate GC content, MFE, length-adjusted MFE (aMFE) and MFE 
density (MFEden) using the ViennaRNA package"! and as previously described”. 
PARS-seq and PARS scores calculations. Parallel analysis of RNA structure 
(PARS) was performed as previously described® on HepG2 polyAt RNA. Reads 
were mapped to the human transcriptome (UCSC known canonical genes) using 
Bowtie2 with local alignment option (--local). Only uniquely aligned reads were 
used for analysis. Read start positions were counted for each base in the transcrip- 
tome, as the read start is +1 to the nuclease cleavage site and therefore informative 
regarding local RNA 2D structure™. Only reads with more than 5 consecutive bases 
matching the transcript at the start of the read were counted. Read counts were 
normalized to the respective library size. PARS-scores were then calculated based 
on the following equation: 


PARS-score(i,j) = log,(V1'y / S1’j) 


V1’, and S1’; are the normalized read-start counts at base (i) in transcript (j) for 
V1 or S1, respectively. 

PARS scores at the region around the start-codon (+150 nucleotides from the 
AUG) were averaged and used for a metagene analysis. 
Motif search. To search for a motif, 101-nucleotide sequences in the AUG start 
codon window, centred on m'A peaks in HeLa and mouse liver, were used. 
Compatible background sequences were used to prevent skewing of the results 
due to over-representation of known motifs in the region of the start codon, such 
as Kozak sequences and the AUG. The findMotifs program from the HOMER 
package® was then run using the settings -rna -len 6,7,8,9,10 -noweight. The same 
peak areas were used with MEME® for de novo motif enrichment analysis, with 
the settings -dna -minw 6 -maxw 10 -evt 0.01 -maxsize 1,000,000. A similar ana- 
lysis was performed on m!A peaks falling in the CDS with background sequences 
randomly generated from non-methylated exons. 199 sequences of 20 nucleotides, 
surrounding the mutations identified above, were used to create a frequency plot 
of the nucleotides surrounding the mutation sites using WebLogo™. 
Comparison of m°A and m!A profiles. m!A and m°A peaks! were intersected 
using BEDTools. Similarity between sets was also evaluated by comparing the 
metagene profiles of the two modifications. 


TIS correlation. TISs data was downloaded from http://www.ncbi.nlm.nih.gov/ 
sra?term=SRA160745 (ref. 21) and genomic locations were converted to transcrip- 
tomic locations (as described above). For each alternative TIS, the closest m'A peak 
was identified and the distance of both from the canonical AUG start codon was 
calculated. Correlation of alternative TISs with the number of m'A peaks per gene 
was carried out by dividing the genes containing alterative TISs to bins according 
to the number of peaks per gene. 

Gene ontology (GO) enrichment. Methylated gene RefSeq IDs were uploaded to 
DAVID Bioinformatics Resources (http://david.abcc.ncifcrf.gov) and functional 
enrichment analysis was performed using all adequately expressed genes as back- 
ground. Resulting enriched GO terms were restricted to fold enrichment > 1.5, 
Bonferroni corrected P< 0.005. 

Human-mouse conservation. m!A mouse liver peaks were converted to the 
homologous coordinates in the human hg19 genome, using the LiftOver tool of the 
UCSC genome browser™. To assess the significance of these results we chose at ran- 
dom an equal number of 200 bp sized areas from within mouse genes that have a 
human orthologue. The areas were chosen from all gene segments in the same pro- 
portion as observed in the original data set. Randomly chosen areas were similarly 
converted and intersected with HepG2 m!A peaks. To calculate the statistical sig- 
nificance the randomization was repeated 1,000 times and Mann-Whitney U test 
was used. 

ALKBHS3 overexpression. Plasmids encoding wild-type and mutant (D193A) 
FLAG-tagged human ALKBH3 were transfected into 293 Freestyle cells using pol- 
yethylenimine (PEI; Polysciences, Inc.) at a ratio of 3 jl PEI per 11g plasmid DNA. 
After 24h, cells were pelleted and washed once with cold PBS before being lysed 
in Cell Lysis Solution (5 Prime). mRNA was purified and digested for LC-MS/MS 
analysis as described in the ‘RNA purification’ and ‘m!A detection and quantitation’ 
sections above. 

Translation efficiency (TE) and ribosome release score (RRS). Sequencing data 
was downloaded: for HEK293, mouse liver and MEFs - SRA160745”, http://www. 
ncbi.nlm.nih.gov/sra?term=SRA160745; for mESCs - GSE30839"%, http://www. 
ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE30839. Reads were aligned and 
FPKM values were calculated (see above). For TE calculation, the Ribo-Seq FPKM 
of the most highly expressed transcript of each gene was divided by the mRNA-Seq 
FPKM of the same transcript. Only transcripts with FPKM and Ribo-Seq FPKM 
above first quartile were considered. TE data for mESCs was downloaded as-is 
from the original paper!”. TE and RRS for HeLa were calculated using our pub- 
lished data’, RRS values were calculated using mRNA-Seq and Ribo-Seq reads as 
described previously**. 

Protein abundance. All protein abundance data analysed in this study was 
downloaded from the MOPED database (Multi-Omics Profiling Expression 
Data, https://www.proteinspire.org): HeLa, HepG2, HEK293 (Experiment ID: 
Geiger_MCP_2012)”, and mouse liver (Experiment ID: Baylor_mouse_liver_ 
profiling), or from Supplementary Data (mESCs’! and MEFs”, corrected as 
described’). For all cell types, excluding mESCs, we used the normalized 
abundances as provided. For mESCs, raw intensities were divided by each leading 
protein molecular weight. Protein abundance data was then log, transformed and 
subjected to an analysis of variance (ANOVA) with mRNA expression (FPKM) 
percentile bins for start m!A and non-m!A genes. For MEFs, the published mRNA 
expression data was used along with the corresponding protein abundance. Genes 
with mRNA expression below the lowest expressed gene containing m’A or with 
FPKM < 0.2 were discarded. 

Analysis of covariance (ANCOVA). ANCOVA was calculated for genes 
methylated at the AUG start codon window (start m!A) versus unmethylated genes 
(non-m!A) as a fixed factor against log protein abundances, while cleaning the 
contribution of the following covariates: logy mRNA expression, TIS Kozak score‘, 
MFE calculated by RNAfold and divided by 5’ UTR length, G-quadruplex motif 
in 5’ UTR”, 5’ TOP motif in 5’ UTR from UTRdb (http://utrdb.ba.itb.cnr.it/), 
number of conserved miR sites (PCT score < 0) from TargetScan’’, and number 
of u/dTISs and u/dAUGs in human and mouse form QTI-seq”!. The remaining 
factors, % GC Content in 5’ UTR, length of 5’ and 3’ UTRs, and number of exons in 
5! and 3’ UTRs were downloaded and calculated from the UCSC genome browser 
(hg19 and mm10). We considered the most expressed RefSeq genes in our data 
for unique transcript-associated parameters and gene symbols for general features. 
Variables were examined for linearity with protein output and for normal distribu- 
tion. Log transformations were employed where appropriate. Tolerance was kept 
above 0.2 for all variables. 
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Extended Data Figure 1 | See next page for figure caption. 


© 2016 Macmillan Publishers Limited. All rights reserved 


60 °C, pH 10.4, 3 hr 
17% m'A, 83% meA 


Extended Data Figure 1 | Experimental conditions for detection, 
quantitation and sequencing of m'A in mRNA. a, Schematic illustration 
of m!A-to-m°A rearrangement (Dimroth rearrangement) under alkaline 
conditions at elevated temperatures. b, mRNA purification scheme before 
LC-MS/MS m!A quantitation (left). Corresponding RNA electrophoresis 
profiles obtained by Agilent 2100 Bioanalyzer (middle and right). 

c, Monitoring m’A-to-m°A rearrangement levels during sample 
preparation for LC-MS/MS. Synthetic 5-nucleotide long RNA 
oligonucleotides containing m!A (upper panel) or m°A (lower panel) 
were digested to mononucleotides, dephosphorylated (see methods) 

and analysed by HPLC-UV. Only minimal (<10%) m'A-to-m°A 
rearrangement was observed (arrow). d, Standard curves for m!A and 
deuterium-labelled m!A (d3-m'A) demonstrate very similar detection 
sensitivity in LC-MS/MS. Mean values + s.e.m. are shown, n= 3. e, The 
change in m'A/A and m°A/A molar ratios (%) during the purification 
scheme outlined in b. Mean values + s.e.m. are shown, n= 3. f, LC-MS/MS 
of mRNA isolated from HepG2 cells labelled with deuterated methionine 
(d3-Met) for 24 h detects d3-m'A, suggesting S-adenosyl-methionine 
(SAM) is the methyl donor. Mean values + s.e.m. are shown, n= 3. 

g, Dot blots demonstrating high anti-m'A antibody specificity. Increasing 
amounts (indicated above the top blot) of synthetic RNA oligonucleotides 
containing m’A, m°A or unmodified A residues were spotted onto a 
membrane and probed with either anti-m‘A or anti-m°A antibodies. 
Anti-m!A antibody detects m'A and does not exhibit cross-reactivity 
with m°A or A (upper blot); anti-m°A antibody demonstrates low, yet 
detectable, cross-reactivity with both m!A and A (lower blot). For blot 
source data, see Supplementary Fig. 1. h, Competitive dot blots were 
performed on separate membranes spotted with 75 pmol of synthetic 
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m'A-containing RNA oligonucleotide. Whereas increasing concentrations 
of free m'A mononucleoside progressively attenuate anti-m'A binding, 
increasing concentrations of free m°A mononucleoside do not. For blot 
source data, see Supplementary Fig. 1. i, Quantitative LC-MS/MS 
demonstrates m'A enrichment—and m°A depletion—following 
immunoprecipitation (IP) with anti-m'A antibody compared to total RNA 
input. Mean values + s.e.m. are shown, 1 = 3. j, Monitoring m'A-to-m°A 
rearrangement levels under different RNA fragmentation conditions for 
use in m'A-seq. Pure m'A mononucleoside in 1 x fragmentation buffer 
(see Methods) was subjected to the conditions specified to the right of 
the chromatograms and directly analysed by injection to HPLC-UV 

for rearrangement to m°A. k, Comparison of competitive m’A elution 
and Proteinase K elution of immunoprecipitated m!A-containing RNA 
fragments from anti-m'A-coupled magnetic beads shows that the two 
elution modes are equivalent. I, Identification of the known m/A site in 
position 1322 of human 28S rRNA validates the accuracy of m!A-seq 
and the use of peak middle points as a close approximation for m!A 

sites. Partial m'A-to-m°A rearrangement increases the coverage around 
this site. m, Conditions for induced m'A-to-m°A rearrangement of 

RNA oligonucleotides that maintain RNA integrity for use in m'A-seq. 

A synthetic 5-nucleotide long RNA oligonucleotide of the sequence 
5’-AC(m!A)UG-3! was subjected to various base/heating conditions 
(indicated to the right of the chromatograms) and directly analysed 

by injection to HPLC-UV for rearrangement to 5/-AC(m°A)UG-3’. 
Incubation at pH 10.4, 60°C for 1h results in rearrangement to m°A in 
40% of oligonucleotides; longer incubation times result in increased rates 
of rearrangement. Chromatograms of untreated RNA oligonucleotides 
appear above and mark the expected retention times. 
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Extended Data Figure 2 | Characterization of human m!A-methylated maps showing interacting proteins and hyperlinks to gene entries 

genes. a, Gene ontology (GO) analysis of methylated HeLa genes that can be reached through http://www.genome.jp/kegg-bin/show_ 
relative to all adequately expressed genes (above the 1st quartile) reveals pathway? hsa03010. c, The average fraction of methylated transcripts 
enrichment of biological processes related to translation and RNA (stoichiometry) increases with gene expression level. r and P values are 
metabolism. Fold-enrichment and P values are indicated for each category. _ indicated, Pearson correlation. d, Pie chart presenting the fraction of m'A 
b, GO analysis of molecular functions reveals enrichment of structural peaks in each of three non-overlapping transcript segments (5’ UTR, 
constituents of the ribosome. Scheme based on an illustration obtained CDS and 3’ UTR) in HEK293 cells. e-g, The fraction of methylated genes 
from DAVID bioinformatics website of the KEGG human ribosome increases with gene expression levels in HepG2 (e), HEK293 (f) and 
pathway. Red stars indicate methylated genes in the pathway. Colouring common human peaks (see Methods) (g). 
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Extended Data Figure 3 | See next page for figure caption. 
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Extended Data Figure 3 | Metagene analyses of human m!A peaks. 

a, Metagene profiles demonstrating sequence coverage along a normalized 
gene transcript. Sequence reads of m'A immunoprecipitation and input in 
HeLa cells are indicated in blue and orange, respectively. b-d, Metagene 
profiles of m!A peak distribution in a non-normalized window centred on 
the AUG start codon (b), extending downstream from the transcription 
start site (TSS) (c) and centred on the stop codon (d), in the indicated 
human cell types. e, Metagene profiles comparing the distribution of m!A 
peaks (blue) to that of negative peaks (black) along a normalized transcript 
composed of three rescaled non-overlapping segments illustrated below, 
in HeLa cells. f, Table demonstrating m'A peak enrichment in growing 
windows centred on the AUG start codon in HeLa cells. Enrichment is 
calculated as number of peaks in the window divided by window size 
(nucleotides). g, Table summarizing the overlap between m°A and m'A 
peaks in HepG2, HEK293 and WT mESCs. m°A peaks are sourced from 
Dominissini et al.', Linder et al.°’ and Geula et al.””, respectively. m'A 
peaks are from the current study. h, i, Metagene profiles of m!A peak 
distribution in a non-normalized window centred on the AUG codon (h) 
and extending downstream from the TSS (i). Peaks are sorted by the exon 
containing the AUG codon and the length of the first exon, respectively. 


j, Metagene profiles of m'A peak distribution in a non-normalized 
window centered on the nearest splice site. Peaks are sorted by the 

exon containing the AUG codon in that gene. k, m'A-induced reverse 
transcription (RT) arrests produce typical mA peaks characterized by a 
central region of reduced coverage with a local minimum (m!A trough) 
Examples are shown. I, m'A-to-m°A rearrangement results in a reduced 
number of identified m'A troughs. Higher rates of rearrangement further 
reduce the number of identified m’A troughs (right panel). Example is 
shown (left panel). m, Metagene profile of m'A trough distribution along 
a normalized transcript composed of three rescaled non-overlapping 
segments illustrated below, in HepG2 cells. n, Metagene profile of mA 
trough distribution in a non-normalized window centred on the AUG 
start codon in HepG2 cells. 0, Metagene profile of m'A trough distribution 
along a normalized transcript in HepG2 cells. p, Metagene profile of 
m/A trough distribution in a non-normalized window centred on the 
nearest splice site in HepG2 cells. q, Pie chart presenting the fraction of 
m!A troughs in each of three non-overlapping transcript segments 

(5’ UTR, CDS and 3'UTR) in HepG2? cells. r, MEME motifs identified in 
100-nucleotide windows centred on m’A troughs that lie within the AUG 
start codon window (+150 nucleotides) in HepG2 cells. 
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Extended Data Figure 4 | Correlation between m’A sites and TISs in 
human. a, b, The mean number of alternative TISs in methylated (m!A) 
versus all other (unmethylated) genes in HeLa (a) and HEK293 (b) cells. 
Mean values + s.e.m. are shown. P values are indicated, Mann-Whitney 
Utest. c, The mean number of alternative TISs per gene as a function 

of the number of m!A peaks per gene in HeLa and HEK2393 cells. 

Mean values + s.e.m. are shown; r and P values are indicated, Pearson 
correlation; regression line is drawn. d, e, The percentage of genes with 


to canonical AUG) 


upstream (=5’ UTR) or downstream (=CDS) m!A sites out of all genes 
that have either downstream or upstream TISs, compared to the expected 
percentage in HeLa (d) and HEK293 (e) cells. P values are indicated, 

x? test. fh, Scatter plots showing the correlation between the locations 
(logs) of alternative TISs and m‘A peaks with respect to the canonical 
AUG start codon (0) in HeLa (f), HEK293 (g) and HepG2 (h) cells: left, 
upstream TISs (uTIS) and 5’ UTRs m!A peaks; right, downstream TISs 
(dTIS) and CDS m!A peaks. r and P values are indicated, t-test. 
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Extended Data Figure 5 | Structure and sequence features of m'A 

sites in human mRNA. a, m’A peaks have a significantly higher GC 
content compared to negative peaks in all three transcript segments: 5’ 
UTR, CDS and 3’ UTR. Box limits represent 25th percentile, median and 
75th percentile, whiskers represent 2.5 and 97.5 percentiles, and dots 
indicate outliers. P=1.5 x 107778, P= 8.2 x 10°?” and P=3.3 x 107771, 
respectively, t-test. b, Motifs identified in 400-nucleotide windows centred 
on the canonical AUG start codon in genes with m'A peaks in this window 
(upper table), or around m!A peaks located in the CDS, outside the AUG 
start codon window (lower table). c, Examples of adenosines around 

m!A peak middles with increased mismatch rates. Fold-enrichment 
values are the ratios of mismatch rates in untreated relative to rearranged 
samples. The top ten highest fold enrichment samples are shown. 


Position downstream of TSS (nts) 


d, e, GC content (d) and minimum free energy density (MFEden) (e) 

of 5’ UTRs of methylated (m'A), unmethylated (Non-m!A) and all 

genes. Box limits represent 25th percentile, median and 75th percentile, 
whiskers represent 2.5 and 97.5 percentiles. P values are indicted, t-test. 

f, Mean GC content (upper panel), PARS score (middle panel) and free 
energy (AG, lower panel) in a 300-nucleotide window centred on the 
start codon of commonly methylated genes relative to non-methylated 
genes (see Methods). Error bars represent s.e.m. g, The PARS scores of 
methylated compared to all other genes in HepG?2 cells in a 150-nucleotide 
window extending downstream from the TSS, calculated in 30-nucleotide 
sliding windows. Each plot represents data from an independent PARS 
experiment. Error bars represent s.d. 
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Extended Data Figure 6 | Features of the mouse m'A methylome 

(part 1). a, Detection of an m’A site at position 1135 of mouse 28S 

rRNA, in mouse liver m!A immunoprecipitation. A drop in sequence 

read coverage can be seen at the methylated position. b, Fold-enrichment 
(immunoprecipitation over input reads) identifies an m'A peak. c, High 
mismatch rate at the identified m!A 1135 in mouse 28S rRNA. d, Pie charts 
presenting the fraction of m'A peaks in each of three non-overlapping 
transcript segments (5’ UTR, CDS and 3’ UTR) in the indicated mouse cell 
types. e, Metagene profiles of m'A peak distribution in a non-normalized 
window centred on the AUG start codon in the indicated mouse cell 

types. f, Table showing m'A peak enrichment in growing windows centred 
on the AUG start codon in mouse liver. Enrichment is calculated as the 
number of peaks in the window divided by window size (nucleotides). 

g, h, Metagene profiles of m'A peak distribution in a non-normalized 


window extending downstream from the TSS (g) and centred on the 
stop codon (h) in the indicated mouse cell types. i, The mean number 
of alternative TISs in methylated (m!A) versus all other (unmethylated) 
genes in MEF cells. Mean values + s.e.m. are shown; P value is indicated, 
Mann-Whitney U test. j, The percentage of genes with upstream (=5’ UTR) 
and downstream (=CDS) m!A sites out of all genes that have either 
downstream or upstream alternative TISs, compared to the expected 
percentage in MEF cells. P value is indicated, x? test. k, 1, Scatter plots 
showing the correlation between the locations (log) of upstream TISs 
(uTIS) and 5’ UTR m!A peaks relative to the canonical AUG start codon 
(0) in mouse liver (k) and MEF (I) cells. r and P values are indicated, 
t-test. m, Metagene profiles of mouse m'A peak distribution in a 
non-normalized window centred on the AUG codon. Peaks are sorted 
by the exon containing the AUG codon in that gene. 
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Extended Data Figure 7 | Features of the mouse m'A methylome 

(part 2). a, Mouse m!A peaks have a significantly higher GC content 
compared to negative peaks. Box limits represent 25th percentile, median 
and 75th percentile, whiskers represent 2.5 and 97.5 percentiles, and 

dots indicate outliers. P= 4.4 x 107!”°, t-test. b, Motifs identified in 
400-nucleotide windows centred on the canonical AUG start codon in 
genes with mA peaks in this window in mouse liver. c, d, GC content (c) 
and MFEden (d) of 5’ UTRs of methylated (m'A), unmethylated (Non-m!A) 


o——3) 


i#—_t i 3’ 5’ = HHH HH + = 3’ 


and all genes. Box limits represent 25th percentile, median and 75th 
percentile, whiskers represent 2.5 and 97.5 percentiles. P values are 
indicted, t-test. e, f, A sliding window profile of mean GC content (e) and 
mean AG (f) in a 300-nucleotide window centred on the canonical AUG 
start codon in methylated (mA) genes compared to all other genes in 
mouse liver. P values are indicated, Kolmogorov—-Smirnov test and f-test. 
g, Representative plots of human-mouse orthologous genes with 
conserved m!A peaks. Plot format as in Fig. 2a. 
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Extended Data Figure 8 | The yeast m!A methylome. a, m'A-seq 
identifies the known mA sites (645 and 2142) in S. cerevisiae 25S rRNA. 
A drop in sequence read coverage (indicated by purple dots) occurs 
around the methylated positions (indicated by dashed lines). 

b, Representative plots of methylated transcripts in S. cerevisiae. 

Plot format as in Fig. 2a. c, The percentage of methylated genes that 
carry 1, 2 or 3 m!A peaks per gene in S. cerevisiae. Out of 843 m'A peaks 


(FC > 4, FDR < 5%) in 778 genes, most (88.6%) are methylated only once. 


Unlike in mammals, m!A is distributed across the coding transcriptome 
without an apparent preferred location. d, m'A-seq identified the known 
mA sites (670 and 2230) in S. pombe 25S rRNA. A drop in sequence 
read coverage (indicated by purple dots) occurs around the methylated 
positions (indicated by dashed lines). e, Representative plot of a 
methylated transcript in S. pombe. Plot format as in Fig. 2a. f, Pie charts 
presenting the fraction of m!A peaks in each of three non-overlapping 


transcript segments (5’ UTR, CDS and 3’ UTR) in S. pombe Sp1 strain 
under the indicated conditions. Under vegetative growth, we identified 
706 m!A peaks (FC > 4, FDR < 5%) in 619 gene transcripts, most of 
which (90.4%) distributed along the CDS. Four hours after transfer to a 
nitrogen-source deficient ‘sporulation’ medium, 157 out of the vegetative 
state m'A peaks were no longer detected, and 297 new peaks appeared. 
Importantly, transcripts that harbour differential peaks were adequately 
expressed (above the 1st quartile) in both conditions. g, The percentage 
of methylated genes that carry 1, 2, or 3+m'A peaks per gene in S. pombe 
Sp1 strain under the indicated conditions. h, Venn diagram representing 
differential and shared m'A peaks in S. pombe Sp1 strain under the 
indicated conditions. i, Representative plots of a differentially methylated 
transcript in S. pombe Sp1 strain under the indicated conditions. 

Yellow box, conserved peak; green box, differential peak. 
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untreated and glucose-starved (c) or heat shock-treated (d) HepG2 cells. 
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Extended Data Figure 10 | m'A around the start codon correlates (Mann-Whitney U test) and fold-changes (FC) of median RRS values 
with higher translation efficiency (TE). a, Cumulative distribution of (Start m'A genes/All the rest) are indicated. c, Genes methylated in 
log,(TE) in genes methylated in a 300-nucleotide window centred on different start codon window sizes have higher TE compared to all other 
the start codon compared to all other genes, in the indicated human and genes, in HeLa cells. When considering all m'A genes, including those 
mouse cell types. P values (t-test) and fold-changes (FC) of median TE methylated outside the start codon window, the effect is reduced. P values 
values (Start m'A genes/All the rest) are indicated. b, Genes methylated in (t-test) and fold-changes (FC) of median TE values (Start m!A genes/All 
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ribosomal drop-off in the 3’ UTR, are in line with TE scores. P values 
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Structural basis for activity regulation of 
MLL family methyltransferases 


Yanjing Li?, Jianming Han!?, Yuebin Zhang’, Fang Cao‘, Zhijun Liu!’, Shuai Li’, Jian Wu!?, Chunyi Hu!’, Yan Wang!?, 
Jin Shuai!?, Juan Chen!, Liaoran Cao’, Dangsheng Li°, Pan Shi’, Changlin Tian”*, Jian Zhang°, Yali Dou*, Guohui Li’, 


Yong Chen!” & Ming Lei}? 


The mixed lineage leukaemia (MLL) family of proteins (including MLL1-MLL4, SET1A and SET1B) specifically 
methylate histone 3 Lys4, and have pivotal roles in the transcriptional regulation of genes involved in haematopoiesis 
and development. The methyltransferase activity of MLL1, by itself severely compromised, is stimulated by the three 
conserved factors WDR5, RBBP5 and ASH2L, which are shared by all MLL family complexes. However, the molecular 
mechanism of how these factors regulate the activity of MLL proteins still remains poorly understood. Here we show 
that a minimized human RBBP5-ASH2L heterodimer is the structural unit that interacts with and activates all MLL 
family histone methyltransferases. Our structural, biochemical and computational analyses reveal a two-step activation 
mechanism of MLL family proteins. These findings provide unprecedented insights into the common theme and functional 
plasticity in complex assembly and activity regulation of MLL family methyltransferases, and also suggest a universal 


regulation mechanism for most histone methyltransferases. 


Methylation of histone H3 Lys4 (H3K4), which is predominantly 
associated with actively transcribed genes!~, is mainly mediated by 
MLL family histone lysine methyltransferases (HKMTs). Mammalian 
MLL family HKMTs contain six members (MLL1-MLL4, SET1A and 
SET1B)?-4, each of which has crucial yet non-redundant roles in cells*®. 
MLLI has been the most intensively studied because of its involve- 
ment by chromosomal translocations in a variety of acute lymphoid 
and myeloid leukaemias’. Recently, inactivating mutations in MLL3 
(also known as KMT2C) and MLL4 (KMT2D) have been identified in 
several types of human tumours and in Kabuki syndrome*””. 

In contrast to most SET [SU(VAR)3-9, E(Z) and TRX]-domain- 
containing methyltransferases, MLL1 protein alone exhibits poor 
HKMT activity!?*. The crystal structure of the MLL1 SET domain 
(MLLI¢p7) reveals an open conformation that is not efficient for the 
methyl transfer from the cofactor S-adenosyl-L-methionine (AdoMet) 
to the target lysine!°. The optimal HKMT activity of MLL1 requires 
additional factors, WDR5, RBBP5 and ASH2L, which are shared core 
components of all MLL complexes and also evolutionarily conserved 
from yeast to humans’*"®. Depletion of any of these components results 
in the global loss of H3K4 methylation to varying degrees'*!8. Despite 
the importance of WDRS5, RBBP5 and ASH2L, it is still unclear how 
these factors stimulate the HKMT activity of MLL proteins. In this 
work, our biochemical and structural analyses reveal how RBBP5- 
ASH2L binds and activates MLL family methyltransferases through 
a conserved mechanism. 


RBBP5-ASH2L binds and activates MLLs 

We first examined the effects of individual components (WDRS, 
RBBP5 and ASH2L) and their combinations on the HKMT activ- 
ities of MLL family methyltransferases. We selected the carboxy- 
terminal conserved regions of MLL proteins containing both the WIN 


(WDR35-interaction) motif and the SET domain in activity assays'* 
(Fig. 1a). For simplicity, hereafter we use ‘MLE to represent the MLL 
WIN-SET fragment, and ‘MLLsegr’ to represent the MLL SET domain 
unless stated otherwise. Consistent with previous observations!*!>!9, 
activity assays showed that the RBBP5-ASH2L heterodimer substan- 
tially upregulated the HKMT activity of MLL1, and this activity was 
further stimulated by the addition of WDRS (Fig. 1b and Extended Data 
Fig. la-c). By contrast, MLL2-MLL4, SET1A and SET1B can be fully 
activated by just RBBP5-ASH2L, and WDRS was dispensable for activ- 
ity regulation (Fig. 1b and Extended Data Fig. la—c). The stimulatory 
effect of RBBP5-ASH2L on MLL HKMT activities indicated a possi- 
ble direct interaction between RBBP5-ASH2L and MLL proteins””-””. 
Indeed, a glutathione S-transferase (GST) pull-down assay clearly 
showed that all MLL proteins directly interact with RBBP5-ASH2L 
(Fig. 1c and Extended Data Fig. 2a). Among them, MLL2-MLL4 could 
be efficiently pulled down by GST-ASH2L-RBBP5, whereas SETIA 
and SET1B maintained a medium level of interaction with RBBP5- 
ASH2L (Fig. 1c and Extended Data Fig. 2a). By contrast, MLL1 only 
exhibited a very weak interaction with RBBP5-ASH2L under low-salt 
buffer conditions (Fig. 1c and Extended Data Fig. 2a). Fluorescence 
polarization analysis also revealed that MLL proteins interact with 
RBBP5-ASH2L with very different binding affinities ranging from 
~100nM (for MLL3) to more than 100,1M (for MLL1) (Extended 
Data Fig. 2b). Formation of the RBBP5-ASH2L heterodimer is a 
prerequisite for MLL binding, as neither RBBP5 nor ASH2L alone 
can stably associate with MLL proteins (Extended Data Fig. 2c, d). 
Notably, MLL proteins can also stabilize the RBBP5-ASH2L interac- 
tion when high-salt buffer was used in the pull-down assay (Extended 
Data Fig. 2a), consistent with the observation that the RBBP5-ASH2L 
interaction is highly sensitive to ionic strength (Extended Data 
Fig. 2e). 
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Figure 1 | RBBP5-ASH2L interacts and activates MLL proteins. 

a, Domain organization of human ASH2L, RBBP5, WDR5 and MLL 
proteins. Only the C-terminal domain of MLL is shown. DBM, DPY30 
binding motif; PHD-WH, plant homeodomain-winged helix. Shaded 
areas denote the interacting domains among these proteins. b, The 
normalized HKMT activities determined by a *H-methyl-incorporation 
assay. Mean + s.d. (n = 3) are shown. c, GST pull-down assay shows MLL 
proteins directly interact with the RBBP5-ASH2Lspry heterodimer. 

d, The normalized HKMT assays revealed that an activation segment of 
RBBP5 (residues 330-344) is crucial for the stimulation of MLL3 activity. 
FL, full-length. Mean + s.d. (n = 3) are shown. 
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Because MLL1 only maintained a weak direct interaction with 
RBBP5-ASH2L, we proposed that full activation of MLL1 by RBBP5- 
ASH2L requires the bridging molecule WDRS that can interact with 
both MLL1 and RBBP5-ASH2L simultaneously. Consistent with this 
idea, the stimulatory effect of WDR5 on MLL1 HKMT activity is min- 
imized when the protein concentration was increased in the assay 
(Extended Data Fig. 2f). Furthermore, the fusion of RBBP5 and MLL1 
together achieved a robust HKMT activity that cannot be further stim- 
ulated by the addition of WDRS5 (Extended Data Fig. 2g), suggesting 
that WDRS per se is not directly involved in the MLL HKMT enzymatic 
reaction. Collectively, we conclude that RBBP5-ASH2L is the major 
functional unit that binds and activates MLL proteins. Conversely, 
WDERS may have an indirect role in promoting HKMT activity by act- 
ing as a bridging molecule to facilitate the formation of MLL complexes 
under certain assay conditions, and this may explain the apparent dis- 
crepancy in reports about the role of WDRS in the activity regulation 
of MLL complexes!?””. 


Complex structure of MLL3-RBBP5-ASH2L 

To determine the structural basis of how RBBP5—ASH2L activates 
MLL proteins, we first dissected the interactions among RBBP5, 
ASH2L and the SET domains of MLL proteins. Consistent with pre- 
vious studies”*4, the ASH2L C-terminal SPRY (splA and ryanodine 
receptor) domain is sufficient to form a heterodimer with RBBP5 to 
stimulate the HKMT activity of MLL proteins (Fig. 1d, compare lanes 
1 and 5). Three adjacent short motifs of RBBP5 were identified for 
the stimulation of MLL HKMT activity (residues 330-344, activation 
segment, AS), the interaction with ASH2L (residues 344-363, ASH2L- 
binding motif, ABM), and the association with WDRS (residues 
369-381, WDR5-binding motif, WBM)**”s (Fig. 1a). A preformed 
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RBBP5as-asm—-ASH2Lspry complex can stimulate MLL3ser HKMT 
activity to levels of ~70% of full-length RBBP5-ASH2L (Fig. 1d, com- 
pare lanes 2, 4 and 6), indicating that this minimized RBBP5,s-apm- 
ASH2Lsppy heterodimer is essential for the stimulation of MLL3sp7 
activity, and that other regions of RBBP5 might have a minor role in 
this process. We determined the crystal structure of this minimized 
ternary complex composed of MLL3¢p7, RBBP5as-apm and ASH2Lepry 
(hereafter referred to as M3RA) in the presence of S-adenosyl- 
L-homocystein (AdoHcy) and a substrate peptide (H3 residues 1-9) 
(Fig. 2a, Extended Data Table 1 and Extended Data Fig. 3a). Notably, 
we crystallized the M3RA complex both with and without the H3 
peptide in one asymmetric unit (Extended Data Fig. 3b). 

In the M3RA complex, RBBP5,4s-apm adopts an extended conforma- 
tion that consists sequentially of two 8-strands (activation segment) 
and a rigid coil (ABM), which respectively mediate the interactions 
with MLL3gpy7 and ASH2Lgpry (Fig. 2a). The overall fold of MLL3gp7 
is similar to other SET-domain proteins, and shares the conserved 
features of N- and C-terminal regions (SET-N and SET-C), an inser- 
tion region (SET-I) and post-SET motifs!’ (Fig. 2b). The active 
site residues of MLL3¢p7, the conformation of the target lysine and an 
invariant water molecule, are almost identical to those of the active 
site of DIM-5 (ref. 28), suggesting a catalytically active configuration 
of MLL3spr (Fig. 2c). The ‘U’-shaped cofactor product AdoHcy binds 
into a well-defined surface pocket on MLL3¢p7 through an exten- 
sive network of highly conserved interactions as observed in other 
SET-domain structures (Fig. 2d and Extended Data Fig. 3c, d). The 
H3 substrate peptide sits in an opposite groove on the surface of 
MLL3¢p7, and an intricate network of hydrogen bonds stabilizes the 
binding (Fig. 2e and Extended Data Fig. 3e). The unique geometry of 
the H3-binding groove specifically recognizes Thr3!8 and Arg2", 
defining the substrate specificity of MLL3spy (Extended Data Fig. 3f, 
g). Since all the H3-peptide-binding residues in MLL3sgr are highly 
conserved in other MLL proteins (Extended Data Fig. 4), we conclude 
that all MLL proteins achieve the substrate specificity towards H3K4 
through the same recognition mechanism as observed in the M3RA 
complex. 


Interfaces in the M3RA complex 

The structure of the M3RA complex reveals extensive interactions 
among ASH2Lspry, RBBP5as- apm and MLL3¢p7. ASH2Lspry rec- 
ognizes RBBP5apm through extensive salt-bridge and hydrogen- 
bonding interactions; the C-terminal portion of RBBP5,gm adopts 
a coiled conformation sitting on two arginine residues (Arg343 and 
Arg367) at the centre of the basic pocket of ASH2Lgpry (Fig. 3a). 
Mutations of ASH2L Arg343 and its interacting residues in RBBP5 4pm 
(Glu349 and Asp353) completely abrogated the RBBP5,agm-— 
ASH2Le¢pry interaction (Extended Data Fig. 5a) and impaired the 
HKMT activity of the MLL3 complex (Extended Data Fig. 5b). The 
primary feature of the RBBP5,s—MLL3sgr interaction is the inter- 
molecular }-sheet interactions involving two strands of the L-shaped 
RBBP5as paring with 34 and an induced strand 87 immediately before 
helix aC of MLL3spr (Fig. 3b). Mutations of residues on this L-shaped 
motif partially decreased the HKMT activity of the MLL3 complex 
(Extended Data Fig. 5c). In addition to these binary contacts, the side 
chain of the conserved Arg4806 of MLL3sgr sticks outside towards 
an acidic pocket formed by both RBBP5 gm and ASH2Lgpry, forming 
five salt-bridge and hydrogen-bonding interactions with Glu347*?"", 
Tyr313457L and GIn35445"7L (Fig. 3c). This extensive electrostatic 
network functions as an anchor point to fix the relative position of 
MLL3gpy7 to ASH2Lgpry, and is crucial for assembly of the MLL3- 
RBBP5-ASH2L complex. 

Because the RBBP5as.4nm—ASH2Legppry-interacting residues are 
highly conserved in MLL-family proteins (Extended Data Fig. 4), 
we proposed that all MLLsgy domains including MLL1sgr should 
interact with RBBP5-ASH2L through the same molecular surface 
as observed in the M3RA complex. In support of this idea, alanine 
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structure of MLL3sp7-RBBP5,5-apm-ASH2Lsgpry in complex with cofactor 
product AdoHcy and the H3 peptide. b, MLL3<er shares the conserved 
features of SET-N, SET-I, SET-C and post-SET motifs. c, Comparison of 
the active centre of MLL3 and DIM-5 (PDB accession 1PEG) complex 
structures. d, The AdoHcy binding pocket in MLL3sgr. e, The substrate H3 
binding channel. Hydrogen bonds are indicated by dashed magenta lines; 
purple sphere denotes a water molecule. 


substitutions of the conserved arginine residues in all MLL proteins, 
which do not affect the overall fold of MLL proteins (Extended Data 
Fig. 5d), abolished the interaction between MLLszr and RBBP5- 
ASH2L (Fig. 3d), and substantially decreased the HKMT activities 
of all MLL complexes (Fig. 3e). Accordingly, mutations of the argi- 
nine-interacting residues on RBBP5 and ASH2L (RBBP5 Glu347 
and ASH2L G1n354) also weakened the association of MLLspy with 
RBBP5-ASH2L, and reduced the HKMT activities of MLL complexes 
(Extended Data Fig. 5e, f). Together, our data confirmed that the elec- 
trostatic network observed at the MLL3sz7-RBBP5—ASH2L interface 
is essential for the interaction between RBBP5-ASH2L and all MLL 
proteins. Interestingly, an inactivating mutation of the same key argi- 
nine residue in MLL4 (Arg5432Trp) that was identified in patients 
with non-Hodgkin lymphoma’ also disrupted the interaction between 
MLL4 and RBBP5-ASH2L and abolished the HKMT activity (Fig. 3d, 
e), indicating that loss of a stable MLL4-RBBP5-ASH2L association 
leads to lymphomagenesis. 


Difference between MLL and other MLL proteins 
The structure of the M3RA complex revealed that subtle sequence dif- 
ferences in the RBBP5-ASH2L-binding region (residues 4804-4814 in 
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MLL3) are probably responsible for the ability of RBBP5-ASH2L to 
distinguish MLL1 from other MLL proteins (Fig. 4a). Most notably, the 
side chain of Val4809 in the SET-I motif of MLL3¢pr fits snugly in a 
shallow pocket formed by both RBBP5 and MLL3gpr (Fig. 4b), which 
can also accommodate the corresponding residues of MLL2, MLL4, 
SET1A and SET1B at the equivalent positions, but not for the bulky 
residue Gln3867 of MLLI1 (Fig. 4a, b). In addition, the side-chain 
methyl group of MLL3sp7 Thr4803 is surrounded by three hydrophobic 
resides of RBBP5as (Leu339, Val343 and Tyr345) (Fig. 4c). By contrast, 
a large hydrophilic residue Asn3861 at this position in MLLI is incom- 
patible with RBBP5 binding (Fig. 4c). Thus, we proposed that two 
residues (Asn3861 and Gln3867) in MLL1 weaken the otherwise 
stable interaction between RBBP5-ASH2L and MLLI. Indeed, both 
MLL1-to-MLL2 (Asn3861Ile/GIn3867Leu) and MLL1-to-MLL3 
(Asn3861Thr/G1n3867 Val) double mutants of MLL1 re-gained stable 
interactions with RBBP5-ASH2L (Fig. 4d), and WDRS had no further 
stimulatory effect on the HKMT activities of these mutants (Fig. 4e). 
Therefore, mutations of these two residues restore the strong RBBP5- 
ASH2L binding ability of MLL1 and thus bypass the requirement of 
WDR3 as the bridging molecule for the optimal HKMT activity of the 
MLL1 complex. This idea is further supported by the crystal structure 
of the MLL1538611/25867L_ RBBP5 as. apm-ASH2L pry complex 
(hereafter referred to as M1MRA, in which ‘M’ denotes mutant) 
(Fig. 4f and Extended Data Table 2). The structure of M1MRA highly 
resembles that of M3RA, with an identical interface as the one between 
MLL3spr and RBBP5-ASH2L (Fig. 4g), strongly indicating that the 
RBBP5-ASH2L heterodimer interacts with and activates all MLL pro- 
teins through a conserved mechanism. Notably, the equivalent residues 
of MLL1 Asn3861 in SET1A (Gln1600) and SET1B (Gln1816) also have 
large hydrophilic side chains and therefore are not optimal for RBBP5- 
ASH2L binding (Fig. 4a). This is consistent with the medium levels 
of interaction of SET1A and SET1B with RBBP5-ASH2L observed in 
the pull-down and fluorescence polarization assays (Extended Data 
Fig. 2a, b). 


Activation mechanism of MLL complexes 

Next we asked why MLL proteins by themselves are catalytically inac- 
tive, and how RBBP5—ASH2L stimulates their HKMT activities. One 
prevailing model suggests that the SET domain of MLL adopts an open 
conformation, and the interaction with regulatory factors induces MLL 
SET domain into a closed conformation". To test this model, we crys- 
tallized apo MLL3gp7 and determined its structure in complex with 
AdoHcy (Extended Data Fig. 6a). Surprisingly, the apo structure of 
MLL3¢pr was almost indistinguishable from the active conformation 
of MLL3sgr in the M3RA complex (Fig. 5a). In addition, we also 
determined the crystal structure of MLLI1¢., (MLLIir 611/Q3867L) the 
SET-I motif of which exhibits an even more closed conformation than 
that in the MIMRA complex (Fig. 5b and Extended Data Fig. 6b). 
Nevertheless, our data clearly showed that both MLL3 and MLLIM by 
themselves are catalytically inactive (Fig. 4e and Extended Data Fig. 1). 
This apparent discrepancy between the low enzymatic activity and the 
closed conformation of MLLI fer or MLL3¢pr led us to propose that, 
in the absence of RBBP5-ASH2L, MLL ser might be highly dynamic, 
and the configuration of MLL SET-I motif captured in the crystal 
structure is a snapshot of a spectrum of conformations of a mobile 
motif. In support of this idea, normal mode analysis revealed a highly 
dynamic motion of the SET-I motif in apo MLLI$-; and MLL3se1, 
which is substantially suppressed upon the association with RBBP5- 
ASH2L (Supplementary Videos 1-5). To test this model experimentally, 
we use !°F-NMR (fluorine-19 nuclear magnetic resonance) to probe 
the structural dynamics of MLL3spr in solution. The 19k_NMR spec- 
trum of Phe4827, a key residue at the substrate-binding site in the SET-I 
motif, displayed two peaks at different chemical shifts, defining at least 
two different conformations or states with dynamic exchanges (Fig. 5c). 
With titration of RBBP5-ASH2L, the '°F-NMR spectrum showed 
prominent changes with conformational equilibrium towards a single 
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Figure 3 | Interfaces among MLL3sp7, RBBP5,s-apm and ASH2L pry. 

a, Detailed view of the ASH2Lspry-RBBP5 py interface. b, The interface 
between MLL3gpr and RBBP5as. c, MLL3 Arg4806 forms an extensive 
salt-bridge and hydrogen-bonding network with ASH2L and RBBP5. d, 
Mutations of the conserved arginine in MLL proteins disrupt interactions 


active state, indicating that RBBP5-ASH2L reduced the flexibility of 
SET-I to lock it in an active state (Fig. 5c). By contrast, Tyr4762 that is 
located in the SET-N motif exhibited no peak shift upon the addition 
of RBBP5-ASH2L (Fig. 5c). 

To provide further insight into this dynamic process, we carried out 
molecular dynamics simulation to investigate how RBBP5-ASH2L 
affects the structures of MLL3spy and MLLIee Results showed that 
RBBP5-ASH2L reduces the root mean square fluctuation of helix aB 
and strand 87 of MLL3spr substantially (Fig. 5d). This coincides with 
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with RBBP5-ASH2L, as shown by the GST pull-down assay in 300 mM 
NaCl buffer. e, Arginine mutations impair the HKMT activities of MLL 
family proteins. Activities of all complexes are normalized to the activity 
of wild-type MLLI-WDR5-RBBP5-ASH2L, and shown as mean + s.d. 
(n=3). 


our observation that the most variable element in apo MLL gpr is the 
oB helix, illustrated by the superimposition of four apo MLL grr struc- 
tures (Extended Data Fig. 6c). Furthermore, a flexible loop in apo 
MLL3¢pr (Lec) is induced to form strand 87 by pairing with strand 31 
of RBBP5as (Fig. 5e). Other than the structural variation of individual 
residues, molecular dynamics simulation also clearly showed that the 
cross-correlation within the SET-I motif was greatly enhanced upon 
RBBP5-ASH2L association (Fig. 5fand Extended Data Fig. 6d-f). The 
reduced flexibility of the SET-I motif may help cofactor binding and 


Figure 4 | Difference between MLL] and other 
MLL proteins. a, Sequence alignment of the 
RBBP5-ASH2L binding fragments from MLL 
family proteins. Two key residues that explain 
the RBBP5-ASH2L binding affinity difference 
between MLL] and other MLL proteins are 
indicated by A and B sites. d, Drosophila; 

h, human. RA denotes RBBP5-ASH2L. b, The 
MLL3-RBBP5 interface around MLL3 Val4809. 
MLL1 G1n3867 (grey) cannot fit into this pocket. 
c, The MLL3-RBBP5 interface around MLL3 
Thr4803, which is not compatible with MLL1 
Asn3861 (grey). d, GST pull-down assay for the 
interactions of RBBP5-ASH2L with MLL spy and 
its mutants. e, The normalized HKMT activities 
of MLL1™7 and MLL1N28°1"/5867L in the presence 
of full-length RBBP5-ASH2L and WDR5- 
RBBP5-ASH2L. Mean +s.d. (n= 3) are shown. 
f, The overall structure of the MIMRA complex. 
g, Superposition of the structures of MIMRA 
and M3RA shows conserved interfaces between 
MLLsgpr and RBBP5-ASH2L. 
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substrate recognition. Indeed, isothermal titration calorimetry (ITC) 
analysis showed that the binding affinities of cofactor to MLL3 spr and 
MLLI{pr are markedly increased in the presence of RBBP5-ASH2L 
(Extended Data Fig. 7a, b). Furthermore, the association with RBBP5- 
ASH2L also facilitates MLL3 binding with the H3 substrate peptide 
(Extended Data Fig. 7c). Notably, Phe336 at the beginning of 61 in 
RBBP5as stacks together with the side chains of MLL3sp7/ MLLI1¢y 
Arg4845/Arg3903, Tyr4846/Phe3904 and Tyr4825/Tyr3883, and the 
latter makes a direct hydrogen-bonding interaction with AdoHcy 
(Fig. 5e). Molecular dynamics simulation revealed an obvious stabiliz- 
ing effect of RBBP5-ASH2L on this network of interactions, which 
could explain the enhanced cofactor-binding ability for the M3RA 
complex (Extended Data Fig. 7d). Quantum mechanics/molecular 
mechanics (QM/MM) investigations further indicated that the pres- 
ence of RBBP5-ASH2L facilitated the methyl transfer process from the 
cofactor AdoMet to the target lysine by lowering the energy barrier 
(Extended Data Fig. 7e). Taken together, we conclude that the RBBP5- 
ASH2L-induced conformational constraints on the SET-I motif help 
to stabilize MLLepy in a conformation competent for cofactor binding 
and substrate recognition. 

Structural comparison of the M3RA complex structures with and 
without the H3 peptide revealed a role of the substrate peptide in 
further stabilizing the active conformation of MLL3gp7, which has 
been observed in other SET-domain methyltranferases**. After H3 
binding, a marked local structural rearrangement occurs to loop Lys 
between helix aB and strand ($5 in the SET-I motif, leading to the 
completion of a narrow hydrophobic channel that orients the H3 
Lys4 side chain for catalysis (Fig. 5g). Remarkably, the side chain of 
MLL3 Val4824 shifts ~4.1 A and rotates ~50° relative to its position 
in the H3-peptide-free structure, enclosing the target lysine access 
channel (Extended Data Fig. 7f). Collectively, our studies suggest 
a novel two-step mechanism for MLL spr activation: interaction 
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Figure 5 | Activation mechanism of MLL 
proteins by RBBP5-ASH2L. a, Structural 
comparison of the apo MLL3ggy and MLL3ger 

in the M3RA complex. The structures are 
superimposed according to AdoHcy. b, Structural 
comparison of the apo MLLI1M, and MLLI1X., 

in the M1MRA complex. c, One-dimensional !°F- 
NMR measurements of MLL3spr with substitution 
of F4827tfmF (top) or Y4762tfmF (bottom) in the 
absence or presence of RBBP5-ASH2L. The 
locations of these two residues on MLL gg are 
shown. tfmF, L-4-trifluoromethylphenylalanine. 
d, Root mean square fluctuation (RMSF) of the 
SET domains in apo MLL3sgr (black line) and in 
the M3RA complex (red line). e, RBBP5 Phe336 
together with MLL3 Arg4845, Tyr4846 and 
Tyr4825 maintain a configuration that favours 
cofactor binding. f, The most highly correlated 
residues (correlation coefficients greater than 
0.55) of SET-I in molecular dynamics simulation 
are indicated by red lines. g, Structural 
superimposition of the M3RA and M3RA-H3 
complexes by the SET-I motifs highlights the local 
rearrangement of loop Lgs. h, A working model for 
the activation of MLL family methyltransferases. 
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with the RBBP5-ASH2L heterodimer reduces the inherent flex- 
ibility of MLLggy and favours formation of a catalytically compe- 
tent conformation; and then H3 substrate binding induces a local 
conformational change in the SET-I motif of MLLspr to achieve the 
fully active configuration that facilitates the methyl transfer process 
(Fig. 5h). 


Implications for other methyltransferases 

Structural comparison of the M1MRA and M3RA complexes with a 
large group of SET domain proteins reveals a striking similarity with 
other intrinsic active methyltransferases. In all SUV39- and SET2- 
family proteins, a short fragment amino-terminal to the pre-SET 
region (referred to as the activation segment) interacts with the SET-I 
motif in the same manner as RBBP5qs binding to MLL3gpy (Extended 
Data Fig. 8a, b). Deletion of this activation segment from DIM-5 did 
not affect the overall fold of the protein but completely abrogated the 
HKMT activity of DIM-5, underscoring the importance of this seg- 
ment in DIM-5 activity regulation (Extended Data Fig. 8c, d). Such 
an activation segment is also found in the EZH2 complex structure’, 
further supporting a conserved activation mechanism for a subset of 
SET-domain-containing methyltransferases. 

In summary, the present structural, biochemical and computational 
analyses provide new insights into the assembly and regulation mech- 
anism of MLL family complexes. Our results suggest that a minimized 
RBBP5-ASH2L heterodimer is the structural unit to interact with and 
activate all MLL family histone methyltransferases. By contrast, WDR5 
is not directly involved in the enzymatic stimulation of MLL complexes. 
WDRS may serve as a recruitment module or crosstalk mediator to 
regulate H3K4 methylation in vivo? **. 

Online Content Methods, along with any additional Extended Data display items and 


Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. 

Protein expression and purification. The SET domains of MLL family proteins 
(with or without the WIN motif), RBBP5, ASH2L, WDRS5 and their truncations 
or mutants were purified as described before’. Escherichia coli Rosetta cells bear- 
ing expression plasmids were induced for 16 h with 0.1 mM IPTG at 18°C, and the 
cells were collected by centrifugation. For MLL expression, 101M ZnSO, was 
included in the media. The cell pellets were resuspended in lysis buffer (50 mM 
Tris-HCl, pH 8.0, 400 mM NaCl, 10% glycerol, 2mM 2-mercaptoethanol, and 
home-made protease inhibitor cocktail). The cells were broken by sonication and 
cleared by ultracentrifugation at 100,000g for 30 min. The proteins were purified 
using Ni-NTA agarose beads (Qiagen) for His-tagged proteins or Glutathione 
Sepharose 4B beads (GE) for GST-tagged proteins, followed by enzyme digestion 
to remove the tags and gel-filtration chromatography. MLLsp7, ASH2Lspry, WDRS 
and RBBP5 fragments were separated on Hiload Superdex 75, while full-length 
proteins of ASH2L and RBBP5 were separated on Hiload Superdex 200. The buffer 
for gel-filtration chromatography contains 25 mM Tris-HCl, pH 8.0, 150mM NaCl 
except for MLL ger (which is in buffer 50 mM Tris-HCl, 300 mM NaCl and 10% 
glycerol, pH 8.0). The purified proteins were concentrated to 10-20mgml' and 
store at —80°C. RBBP5 peptides were separated on Hiload Superdex 75 after tag 
digestion in buffer (100 mM NH4HCOs) and the peptide-containing fractions were 
lyophilized. The MLLsp7—-RBBP5as-apm-ASH2Lspry complex was obtained by 
step-wise gel-filtration chromatography; binary complex of RBBP5as-anm- 
ASH2Lspry was first purified, and then mixed with MLL] ¢p7 or MLL3¢p7, followed 
by separation on Hiload Superdex 75. Mutations were introduced by PCR-based 
site-directed mutagenesis, and mutated proteins were purified using the same 
protocol as described above. 

Crystallization, data collection and structural determination. For structural 
studies, more than 50 different combinations of numerous RBBP5 fragments, 
ASH2Lgpry constructs, and SET domains from different MLL proteins were tested 
for crystallization. MLL3¢e7 was crystallized in 100 mM Tris-HCl, pH 8.5, 3M 
NaCl at 4°C in the presence of 1 mM AdoHcy. Zinc single-wavelength anoma- 
lous dispersion (SAD) and native data sets of MLL3spry were collected at SSRF 
(Shanghai Synchrotron Radiation Facility in China) beamline BL17U at wave- 
lengths of 1.2818 Aand 0.9793 A, respectively. Data were indexed, integrated, and 
scaled using program HKL2000 (ref. 35). Crystals belong to space group P4,32 and 
contain one MLL3¢pr per asymmetric unit. Zinc SAD phase determination, den- 
sity modification and automatic model building were carried out using SHARP**. 
The initial model was further refined using the native data set diffracted at 2.8 A. 
After several rounds of refinement in PHENIX package*” with manual rebuilding 
in COOT*®, the final model has good stereochemistry with an R value of 18.0% 
and an Réee of 22.9%. 

The MLL3¢p7-ASH2Lgpry-RBBP5 45. 4pm complex was crystallized in 100 mM 
sodium cacodylate, pH 6.5, 10% PEG-3350, 0.1 M MgCl at 4°C in the presence 
of 1mM AdoHcy. The co-crystal with H3 peptide (ARTKQTARK) was obtained 
by soaking crystals in reservoir solution with 1mM H3 peptide for 2h before 
collection. A data set of 2.4A resolution was collected at Advanced Photon Source 
beamline 21ID-D at wavelength of 1.1272 A. The crystal belongs to space group 
P2;2)2 with cell dimension a = 80.342 A, b= 236.076 A, c= 44.416 A. The complex 
structure was solved by molecular replacement using PHASER® with ASH2Lspry 
structure (PDB accession 3TOJ) and the MLL3 structure SET-N, SET-I, and SET-C 
motifs as search models. There are two MLL3gp;-ASH2Lsppy-RBBP5,s5.apm Com- 
plexes in one asymmetrical unit, and we can only observed H3 peptide in the 
density map of one complex. The model was further refined using PHENIX with 
manually rebuilding in COOT. 

MLL Nate1l Qose7t crystals were grown by sitting drop vapour diffusion 
method at 4°C in a solution containing 35% (v/v) tacsimate, pH 7.0, in the pres- 
ence of 2mM AdoHcy, and the crystals were cryo-protected in the same reservoir 
solution supplemented with 20% glycerol. Data sets were screened and collected 
at SSRF BL18U and BL19U. The structures were solved by molecular replacement 
(starting model PDB accession 2W5Y). The MLL1N38611/ 38671 —ASH2Lspry- 
RBBP5as-apm complex was crystallized at 200 mM NaCl, 20% PEG3350 in the 
presence of 2mM AdoHcy. A data set of 1.9 A resolution was collected at SSRF 
BL17U at wavelength of 0.9792 A. The structures were solved by molecular 
replacement and further refined with PHENIX. All structure figures were gener- 
ated using PyMOL (The PyMOL Molecular Graphics System, version 1.4.1 
Schrédinger, LLC.). 

Histone methyltransferase assay. In vitro methyltransferase assays were per- 
formed using an H3 peptide as the substrate. Two assay systems were used. The 
first one is the *H-methyl-incorporation assay that measured the incorporation 
of 3H from [3H]AdoMet (S-adenosyl-L-[methyl-*H]-methionine) into the H3 
peptide (9 mer: ARTKQTARK). Reactions were carried out at 22°C for 1h in the 
buffer containing 20 mM HEPES, pH 7.8, 5% glycerol, 5 mM dithiothreitol (DTT), 
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0.5mM EDTA, 1 Ci [7H] AdoMet as previously described!’. Unmodified H3 K4 
peptides (0.25 mM) and 1,1M of WDRS5, RBBP5, ASH2L and MLL proteins were 
used, except for SET1A (511M). For all activity assays, full-length WDR5, RBBP5 
and ASH2L were used unless stated otherwise. MLL constructs containing both 
the WIN motif and SET domain were used. Each assay was performed in triplicate, 
and the mean + s.d. was reported. The second assay system is to monitor the meth- 
ylation kinetics of the H3 peptide substrate using MALDI-TOF (matrix-assisted 
laser desorption ionization-time-of-flight) mass spectrometry. 

Mass spectrometry analysis of the methylation process. Methylation reactions 
were carried out in 20mM HEPES, pH7.8, 10mM NaCl, 5mM DTT, 250M 
AdoMet, 10|1.M histone peptide (ARTKQTARKS) and 11M MLL complexes 
at 22°C. The reaction was quenched at different time points by addition of trif- 
luoroacetate (TFA) to 0.5%. Reaction mixture was diluted in 10 mgml~! CHCA 
(a-cyano-4-hydroxycinnamic acid) matrix and was spotted onto sample plate and 
air-dried. The molecular mass was measure by MALDI-TOF (AB SCIEX TOF/ 
TOF 5800) operated in reflectron mode. Final spectra were the average of 200 shots 
per position at 200 different positions chosen randomly on each spot. To estimate 
the pseudo-first-order rate constants, we fit the decrease in the relative intensity of 
the unmodified peptide over time using a model for a single irreversible reaction 
[Lys4];= [Lys4]oe~*, in which [Lys4]p is the initial concentration of the unmodified 
peptide, [Lys4], represents the concentrations of the unmodified peptide at time t 
and k is the pseudo-first-rate constant. 

GST pull-down assays. GST-fusion proteins and interacting partners were incu- 
bated with glutathione Sepharose 4B beads for 2h at 4°C in 10011 buffer (50 mM 
Tris-HCl, 300 mM NaCl and 2mM DTT, pH8.0). After extensive wash with the 
same buffer, the bound proteins were eluted in elution buffer (50 mM Tris-HCl, 
pH8.0, 300 mM NaCl and 15 mM reduced glutathione). The input samples and 
eluted samples were visualized on 12% SDS-PAGE by Coomassie blue staining. 
Initially, different pull-down buffers were tested and it was found that the interac- 
tion between ASH2L and RBBP5 could be disrupted by high ionic strength used in 
the pull-down assay, whereas the formation of the MLL-RBBP5-ASH2L trimeric 
complex is relatively insensitive to salt concentration. Thus, in most pull-down 
assays, buffer with 300 mM NaCl was chosen to assure undisrupted RBBP5-ASH2L 
interaction and also keep protein stable through pull-down experiments unless 
stated otherwise. 

Isothermal titration calorimetry. The equilibrium dissociation constants of cofac- 
tor binding to MLLsp7 or MLLsp7-RBBP5as-4sm-—ASH2Lspry were determined by 
an ITC200 calorimeter (GE healthcare). The binding of proteins (20-200 1M) and 
cofactor AdoMet (0.5-2mM) were measure in the 25 mM Tris-HCl, pH 8.0, 
300mM NaCl at 20°C. ITC data were analysed and fit using Origin 7 (OriginLab) 
using one-site model. Owing to instability of apo MLL protein during ITC experi- 
ments, curve fitting errors for apo MLL titration are relatively large, so the binding 
parameters of apo MLL proteins are rough estimations. 

Fluorescence polarization assay. Different MLL proteins were diluted in 20 mM 
HEPES, pH7.8, 150mM NaCl, 10% glycerol, 0.5 mg ml"! BSA to a serial of con- 
centrations from 25nM to 50,.M. The FAM-labelled RBBP5 peptide (residues 
330-363) was mixed with ASH2L sppy and used at a final concentration of 100nM. 
The final volume was brought up to 10011 with dilution buffer (20 mM HEPES, 
pH7.8, 150mM NaCl, 10% glycerol and 0.5 mg ml“! BSA) and incubated in dark 
for 30 min. The fluorescence polarization values were measured using Synergy 
Neo Multi-Mode Reader (Bio-Tek) at 27°C. Excitation wavelength was 485 nm 
and emission was detected at 528 nm. Fluorescence was quantitated with GEN 5 
software and date was analysed with Prism 6. For MLL1, SET1A and SET1B, the 
binding is not saturated even at the highest protein concentration, so the calculated 
Ky should be an estimated lower limit of Kg value. 

19F_NMR spectra measurements. Expression of '’F-labelled proteins was achieved 
by an established protocol by incorporation of non-natural amino acid tfmF 
(u-4-trifluoromethylphenylalanine) into specific sites using genetic code TAG”. 
The !°F-labelled MLL3er proteins were purified using the same protocol as for 
wild-type MLL3sz7 protein. The !?F-NMR spectra were obtained on a Bruker DMX 
Avance-500 MHz spectrometer equipped with a 5 mM PABBO room temperature 
probe. The spectra of 0.3mM MLL3 F4827tfmF or 0.35 mM MLL3cp7 Y4762tfmF 
with or without RBBP5-ASH2L were collected at 293 K. The observation channel 
was tuned to }°F (470.54 MHz), with 512 free induction decay accumulations in 
every 3-s recycling delay. Each one-dimensional !°F-spectrum was acquired with 
a standard pulse program with a 90° pulse width of 16.75 1s and power at 35.9 W. 
19°Fchemical shifts were referenced to an external standard TFA. The free induc- 
tion decay accumulations, which consisted of 20,480 complex points, was linear 
predicted to 40,560 points, backward linear predicated three points, and apodized 
with 20 Hz Lorentzian filter. All spectral processing was performed with Topspin 
3.2 software. 

Normal mode analysis. Normal modes were calculated using the NOMAD-Ref 
method*!. For all MLL3sp7, MLLIsr, M3RA and M1MRA structures, default 
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parameters in the method were used, including the analysis of ‘all atoms, ‘default 
distance weight parameters for elastic constant’ of 5.0 A, ‘ENM cutoff values’ of 
1A, ‘average RMSD in output trajectories’ of 1.0 A and ‘output’ of the lowest 16 
modes. The first six trivial normal modes are discarded because they represent 
only translation and rotation. The motion patterns under certain mode and angle 
monitoring were achieved by using PyYMOL. 

Molecular dynamics simulation. To delineate how RBBP5-ASH2L modulates the 
dynamic behaviour of MLL spr domain, we performed molecular dynamic simu- 
lations (100 ns) of MLL3gz7 and MLL Iger in the presence or absence of RBBP5- 
ASH2L, respectively. The complex structure of MLLspr with RBBP5—ASH2L were 
centred into a 115 x 115 x 115 A? cubic box and dissolved with TIP3P waters. 0.1 M 
NaCl ions were used to neutralize the net charge of the system. While for the sys- 
tems of MLLspr alone, we just removed the RBBP5-ASH2L from the complex to 
make sure the identical conformations of MLLsgr before performing molecular 
dynamics simulations. The same procedures were used in setting up the MLL ser 
domains without RBBP5-ASH2L except for a smaller cubic box (83 x 83 x 83 A’). 
All molecular dynamics simulations were performed using Gromacs 5.0.4 with 
Charmm36 force field. After the energy minimization of the whole system using 
the steepest descent algorithm, we first gradually heated the system to 300 K under 
NVT condition. Then we equilibrated the solvent and ions around the protein 
using NPT ensemble. In the equilibrations, the backbone of the protein was con- 
strained with a harmonic potential of 1,000kJ mol’. The leap-frog integrator was 
used with an integration time-step of 2 fs. The Berendsen barostat was used to 
control the pressure at 1 bar with a coupling constant of 2 ps and the modified 
Berendsen (V-rescale) thermostat was employed to control the temperature of the 
systems at 300 K with a time constant of 0.1 ps. The Particle Mesh Ewald method 
was used to compute the electrostatic interactions with a real-space cut-off distance 
of 1nm. The same cutoff value was chosen for treating the van der Waals interac- 
tions. After a 5 ns equilibration, we conducted the production molecular dynamics 
by changing the pressure and the thermostat coupling to Parrinello-Rahman and 
Nose-Hoover with coupling constants of 5 ps and 1 ps, respectively. 

The dynamical network analysis of MLL3spy and MLLer were performed 
using networkSetup in VMD. Ca atoms of MLL ser were defined as the node 
domains and the dynamical contact was drawn if two nodes were within a cutoff 
distance of 4.5 A for at least 75% of the molecular dynamics trajectory. The cross 
correlation data were also calculated to weight edges in the dynamical network. 
The edge distances dj, which define the probability of information transfer across 
a given edge: dj = —log(|C;j|), were derived from pairwise correlations (Cj) using 
program Carma. 


To investigate how RBBP5-ASH2L affects methyl transfer process from the 
cofactor AdoMet to the target lysine of the H3 substrate, we performed QM/MM 
simulations to calculate the potentials of mean force for the methyl transfer reac- 
tion along the reaction coordinate (RC) of r(Cy — Ss) — r(Cyy — Ny). Initial struc- 
tures of the QM/MM simulations were derived from the snapshots of molecular 
dynamics trajectories in the presence of AdoMet and H3 substrates, simultane- 
ously. Then each structure was solvated in an equilibrated 25 A spherical water box 
represented by the TIP3P water model. The water box was centred at the centre 
of mass of the target lysine residue accepting the methyl group. In total, 20 atoms 
were selected as the QM zone, including the sulfur atom and the to-be-transferred 
methyl group on the peptide as well as the lysine residue. The simulation was 
performed in NVT ensemble at 300 K. The hybrid QM/MM method was used in 
the simulation. QM interactions are calculated using semi-empirical AM1 method 
and three GHO atoms (C4! and CB, which connect the sulfur atom to the other 
two methyl group, and CD of the lysine residue) were selected as the boundary 
between QM and MM regions. The solvent boundary potential was treated by the 
generalized solvent boundary potential method and all atoms out of the water box 
were fixed. The umbrella sampling method was used to model the reaction process, 
with the reaction coordinate set as the difference between the sulfur atom on the 
peptide and the nitrogen atom on the QM lysine. The whole reaction process was 
distributed into 46 windows and the corresponding reaction coordinate ranged 
from —1.5 to 2.0 A with an interval of 0.1 A. Systems were restrained to each win- 
dow with a force constant of 500 kcal mol! A~?. 


35. Otwinowski, Z. & Minor, W. Processing of X-ray diffraction data collected in 
oscillation mode. Methods Enzymol. 276, 307-326 (1997). 

36. Vonrhein, C., Blanc, E., Roversi, P. & Bricogne, G. Automated structure solution 
with autoSHARP. Methods Mol. Biol. 364, 215-230 (2007). 

37. Adams, P. D. et al. PHENIX: building new software for automated 
crystallographic structure determination. Acta Crystallogr. D 58, 1948-1954 
(2002). 

38. Emsley, P. & Cowtan, K. Coot: model-building tools for molecular graphics. 
Acta Crystallogr. D 60, 2126-2132 (2004). 

39. Storoni, L. C., McCoy, A. J. & Read, R. J. Likelihood-enhanced fast rotation 
functions. Acta Crystallogr. D 60, 432-438 (2004). 

40. Shi, P. et al. Site-specific !9F NMR chemical shift and side chain relaxation 
analysis of a membrane protein labeled with an unnatural amino acid. 
Protein Sci. 20, 224-228 (2011). 

41. Lindahl, E., Azuara, C., Koehl, P. & Delarue, M. NOMAD-Ref: visualization, 
deformation and refinement of macromolecular structures based on all-atom 
normal mode analysis. Nucleic Acids Res. 34, W52-W56 (2006). 


© 2016 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


a 
100 100 100 
= 75 = 15 = 75 
Fe 50 2 50 2 50 
< 4 4 
25 25: 25: 
ae o & go © .» & . . 
YM PC PSEH FS F YX SC SCPC SCE LS Pw PC CPE ESF 
VFI SE SS ES PSE SS ES SF S VEE SS SF ES 
Fr rw” 8? or 8? i Fe oF oy og? PP Fe oF og? 
ve ee ees PLL ES SS ME LS SE SS 
Po oe Ra aC es PF a ot 
CMM SF Xe SF Yee Ss 
x 
y ve 3 
s RS RY 
100 400. 100 
S 75. 2 75, S75 
£ 2 2 
= 50 2 50 S 50 
o ry Oo 
< g < 
25 25 25 
, . i o 6 6 & oo & 
bv © oo & © » & 2 oy 
SH cay 2? © © SC? Pr yy oo © © © o> © FMR SF SF SF SE SH 
*, e oe Ros oe a vo SSF ir Ss x, e & RS 8 rors 
OF 8 PF og a og? FP SW Wr 
PLL ES S SLES EES SE ELE SS SC 
x x 
IRE A See re 
wo £ & ef XS & ee 
> s e 
s é 3 
MLL1-W-A-R MLL1-A-R MLL1 MLL2-W-A-R MLL2-A-R MLL2 
Un Mono Di Tri Un Mono Di Tri Un Mono Di Tri Un a v Tri Un v v v Un Mono Di Tri 
| . xVe . Y Ye y . Y xV eV s 
1hr 1hr 1 hr 
| 1 hr 
1146 1160 1174 1188 1146 1160 1174 1188 1146 1160 1174 1188 4146 1160 1174 1188 4146 1160 1174 1188 1146 1160 1174 1188 
miz miz miz miz miz miz 
MLL3-W-A-R MLL3-A-R MLL3 MLL4-W-A-R MLL4-A-R MLL4 
Un Mono ] Tri Un Mono Di v Un Mono Di Tri Un Mono Di Tri Un Mono Di Tri Un Mono Di Tri 
Lj y.y. Y V.¥.Y. aye ye Y V.«V VY. VY V.y.y. y «VY eV a 
4 hr 
30 mins 30 mins 20 mins 20 mins thr 
~ u ke le a) i L. 
= ee he prod = 
MsersD ais we nase Peo ar ie> SNS) A100: MAB. 1146 1160 1174 1188 1146 1160 1174 1188 1146 1160 1174 1188 
miz miz miz miz miz 
SET1A-W-A-R SET1A-A-R SET1A -W-A- -A-I 
V “ys Di iTri q T Y v Un Mono Di Tri SEII6 WAR SETB AR ; SETIE . A 
v.V s: y i: a v v v Un Mono Di Tri Un Mono Di Tri Un Mono Di Tri 
Vos VeVe OV OV eWeYs VY. 
4hrs 4hrs 8 hrs | | ; 4hr 
cae il | 
bead k : 
1146 1160 1 1188 1146 1160 Ha 1188 1146 1160 1174 1188 1146 1160 1174 1188 1146 1160 1174 1188 1146 1160 1174 1188 
miz miz miz miz 
c 400. —- MLLI-A-R k=0.48hr 100: 100: 
2 2 — MLL2-A-R k=1.46hr 2 
3 4 -© MLLI-W-A-R k=1.94hr7 3 3 <= MLL3-A-R k=5.34 hr* 
2 = “© MLL2-W-A-R k=2.37hr™ = 
2 50 8 50 B soy MULLS WAR k=4.29 hr’ 
E = = 
5 5 5 
$ 3 3 
x x rd 
0 0 
0.0 05 1.0 45 20 0.0 Os 10 45 20 0.0 0.5 10 41.5 20 
Timeth) Time(h) Timet(h) 
100: 100 400. 
£ ae MLLA-AR | k=13.14 het 2 2 J 4 
3 ; 3 3  SETIBAR = k=3.22 hr 
Pe > MLLA-W-A-R k=12.28 hr? g & 7 : 
3 5 Be SETIB-W-A-R k=2.31 hr 
2 & 50] SETIA-AR = k=0.28 hr? 4 
= 
s 
$ $ > SETIAW-AR k=0.30hr7 S 
x x x 


o 


0.0 


05 1.0 


Time(h) 


Extended Data Figure 1 | Methyltransferase activity of MLL1-MLL4, 
SET1A and SET1B with the different combinations of WDR5, 

RBBP5 and ASH2L. a, HKMT activities determined by the 7#H-methyl- 
incorporation assay. MLL constructs were chosen to contain both the WIN 
motif and the SET domain. Full-length WDR5, RBBP5 and ASH2L were 
used. The HKMT activities are normalized to the activity of the MLL- 
WDR5-RBBP5-ASH2L complexes setting at 100%. Mean +s.d. (n =3) 

are shown. b, Representative MALDI-TOF spectra at different time points 
for MLL complexes and apo MLL proteins clearly revealed that MLL 
complexes have much higher HKMT activities than apo MLL proteins. 
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The peaks for unmodified (un) and mono-, di- and tri-methylated 
products are labelled. The minor peaks are sodium adducts of major 
peaks (+22 Da). Asterisks denote the adduct of un-peaks; filled circles 
denote the adduct of mono-peaks; and filled squares denote the adduct of 
di-peaks. c, Comparison of the overall rates of the methylation reactions 
catalysed by different MLL proteins in the presence of WDR5-ASH2L- 
RBBP5 or ASH2L-RBBP5. The overall rates were derived by fitting the 
decrease in the relative intensity of the unmodified H3 peptide peaks in 
MALDI-TOF mass spectra using one-phase exponential decay model 
[Lys4];= [Lys4]oe™. 
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Extended Data Figure 2 | See next page for figure caption. 
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Extended Data Figure 2 | Interactions between MLL proteins and 
RBBP5-ASH2L. a, GST pull-down assays showed direct interactions 
between MLL proteins and RBBP5-ASH2L. ASH2L C-terminal SPRY 
domain has been previously shown to interact with RBBP5. GST-fused 
ASH2L¢gpry was incubated with full-length RBBP5 and different MLLgpr 
proteins in the GST pull-down assay. Bound proteins were eluted and 
separated by SDS-PAGE. Three different salt concentration buffers were 
tested. b, Fluorescence polarization assay shows that MLL proteins can 
interact with RBBP5,s_agm-ASH2Lsppry with different affinities. For 
MLL1, SET1IA and SET1B, lower limits of the Kg values are reported 
because saturation of the binding could not be achieved in fluorescence 
polarization assays. c, GST-RBBP5 alone cannot pull down MLL proteins 
in the buffer with 300 mM NaCl. d, GST-ASH2Lspry alone cannot pull 
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down MLL proteins in the buffer with 300 mM NaCl. e, The RBBP5- 
ASH2L interaction is highly dependent on the salt concentration used 

in the assay. ITC measurements were carried out using ASH2Lspry and 
RBBP5as-apm under buffer conditions with different salt concentrations. 
f, The requirement of WDRS in methyltransferase activity of the MLL1 
complex is sensitive to protein concentration. MLL1 (5 1M) could be 
markedly stimulated by equal amounts of ASH2L-RBBP5, and WDRS had 
a minor stimulation effect. gs HKMT activities of RBBP5-MLLI fusion 
proteins in the presence of ASH2L or ASH2L and WDRS. Full-length 
RBBP5 was fused to MLLI (residues 3754-3969) with a GGSGGS linker. 
The addition of ASH2L substantially stimulated the HKMT activity of the 
RBBP5-MLLI fusion protein, while further addition of WDRS5 only had a 
marginal effect. 
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MLL3 Molecules 
with H3 peptide 


Es Seat 


Extended Data Figure 3 | The overall structure of the MLL3spy- 
RBBP5as-asm-ASH2Lgpry-H3 complex. a, The overall structure of the 
MLL3¢¢7-RBBP5as-anpm-ASH2Lspry—H3 complex in cartoon diagram. 
ASH2L is in yellow-orange, RBBP5 in cyan, MLL3gpy in salmon, the H3 
peptide in yellow, and cofactor product (AdoHcy) in blue. The electron 
density (2F, — F.) map, contoured at 1a, is shown for the RBBP5 fragment, 
the H3 peptide and AdoHcy. b, The electron density (2F, — F.) map, 
contoured at 1a, is shown around the substrate-binding channel. There 
are two complexes in one asymmetric unit. One complex has clear electron 
density for H3 residues 2-7 (left), while the other exhibits no extra 

density in the substrate channel (right). c, Cofactor interaction network. 
Residues important for the AdoHcy-MLL3¢pr interaction are shown in 
stick models. Hydrogen bonds are indicated by dashed magenta lines. 


MLL3 Molecules 
without H3 peptide 
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d, The space-filling model of MLL3sgr shows that AdoHcy and H3 bind 
to the opposite surfaces on MLL3¢pr. The distance between the sulfur 
atom and ¢-amine of Lys4 is shown. e, The binding interface between 
MLL3¢pr and H3. f, MLL3gpr is in surface representation and coloured 
according to its electrostatic potential. Thr3 of H3 sits snugly on a shallow 
hydrophobic depression, which cannot accommodate residues with a large 
side chain. Arg? is involved in electrostatic interactions with MLL3spr. 

g, Sequence alignment of histone methylation sites. Residues are 
numbered relative to the target lysine. Because only the Lys4 site of H3 
contains a large basic residue and a small residue occupying the —2 and 
—1 positions respectively, Arg(—2) and Thr(—1) define the substrate 
specificity of MLL complexes. 
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WIN Motif 
hMLL1 HLRKSAFDMFNFLASKHRQPPEYNPNDEEEE------ EVQLKSARRATSMD- 3813 
hMLL2 LRKCTFDMFNFLASQHRVLPEGATCDEEED------ EVQLRSTRRATSLE- 2559 
AaTRX Y SNRSEYDMFSWLASRHRKQPIQVFVQPSDN------ ELVP---RRGTGSN- 3572 
hMLL3 KMSAHVKRFVLRPHTLNST----STSKSFQS------ TVTGELNAPYSKOFV 4755 
hMLL4 KILTHYK- ---RPHTLNST----SMSKAYQS------ TFTGETNTPYSKQFV 5381 
daTRR KQROLLVWR--KPHTQRTAGSCSTQRMANSA------ AIAGEVACPYSKQFV 2275 
hSETI1A YYPISKKEKDKY-LDVCPVSARQLEGVDTQG--9---SEQRRLLSAIGTSAI 1552 
hSET1B FYTIDKKDKLRY -LNSSRASTDEPP-ADTQG--18--SEQRRLLSSFTGS-- 1768 
dasET1 FYKLDVREKAKHKYHYAKANTEDSFNEDRSD--25--SNQRRLLTAFGSM-- 1486 
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e 
WGP important for RbBP5-Ash2L binding ([///)) cofactor binding | Zn-binding 


(OP Residues that distinguish MLL1 from other MLL proteins @SET-I Hinge point 
H3 recognition and active center @Phe/Try Switch [i wors-interaction motif 


% Mutations found in Kabuki syndrom ™@ Mutations found in non-Hodgkin lymphoma 


Extended Data Figure 4 | Sequence alignment of MLL homologues interactions are highlighted in magenta. Four Zn-binding cysteine residues 
from human, Drosophila and Saccharomyces cerevisiae. The WDR5- are highlighted in pale yellow. Residues important for cofactor binding are 
interacting motif (WIN) and SET domain are aligned. Secondary in brown; residues important for substrate H3 binding and maintenance of 
structure assignments based on the MLL3 structure are shown as cylinders _ the active centre are in grey. Two glycine residues, which serve as the hinge 
(a-helices) and arrows (3-strands) above the sequences. The WIN motif for SET-I motif rotation, are indicated by blue dots. The residues with 

is coloured in blue, SET-N in green, SET-I in orange, SET-C in purple and the corresponding MLL4 mutations found in Kabuki syndrome and 
post-SET in magenta. Conserved residues important for RBBP5-ASH2L non-Hodgkin lymphoma are indicated by stars. 
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Extended Data Figure 5 | The ternary interaction interface among MLL, 
RBBP5 and ASH2L. a, Mutations of RBBP5 and ASH2L disrupted the 
interaction between ASH2Lspry and RBBP5. Left, GST-RBBP5339_331 was 
used to pull down ASH2Lgpry and its mutants. Right, GST-ASH2Lspry 
was used to pull down full-length RBBP5 and its mutants. Several control 
mutations (such as ASH2L(Q354A) and RBBP5(E347A)), which are not 
on the RBBP5-ASH2L interface, did not affect the interaction between 
ASH2L and RBBP5. b, ASH2L and RBBP5 mutants that disrupted the 
RBBP5-ASH2L interaction decreased the HKMT activities of the MLL3 
complex. The activities of the mutant proteins are normalized to the 
wild-type MLL3-RBBP5-ASH2L complex. Mean + s.d. (n = 3) are shown. 
c, Mutations of RBBP5as residues decreased the HKMT activity of the 


~~ + ASH2L. 


eee —ReePs,, 
ee «GST-MLL3,., 


‘SPRY 


Activity (%) 


50 


MLL1 


MLL2 MLL3 MLL4 SET1A SET1B 


MLL3 complex. d, Representative gel-filtration profiles for MLL and MLL 
mutant proteins indicate MLL mutant proteins have a similar fold to 
wild-type protein. e, GST-MLL3¢er was used to pull down full-length 
RBBP5, ASH2Lgpry and their mutants. Mutations of RBBP5 Glu347 

and ASH2L Gln354 in the ternary interface impaired the interaction 

with MLL3sg7. Mutations of RBBP5,s residues (Phe336Ala, Glu338Ala/ 
Leu339Ala) also decreased the interaction with MLL3sgr to different 
degrees. f, RBBP5(Glu347Ala) and ASH2L(G1n354Ala) compromised the 
HKMT activities of all MLL complexes, indicating that RBBP5-ASH2L 
regulates MLL family proteins through the same mechanism. Activities 
of mutant complexes are normalized to the activity of wild-type MLLsg7- 
RBBP5-ASH2L, setting at 100%. Mean +s.d. (n =3) are shown. 
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Extended Data Figure 6 | Activation mechanism of MLL proteins. a, The 
structure of MLL3gpr is shown in cartoon diagram. The electron density 
(2F, — F.) maps (contoured at 1a) of AdoHcy are shown. b, The structure 
of MLL1¥, is shown in cartoon diagram. The electron density (2F, — F.) 
maps (contoured at 1a) of AdoHcy are shown. c, Structural comparison of 
MLL lser (PDB 2W5Y), MLLI1S1, MLL3ser and MLL4ger (PDB 4ZAP) 
suggests that the SET-I motif is intrinsically flexible, and can be captured 
in different configurations by crystallization. There are two highly 
conserved glycine residues serving as hinge points that connect the SET-I 
motif to the rest of MLL spy. The rotation of helix oB in the SET-I motif 
refers to an axis defined by the two hinge points of SET-I as indicated. 
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d, Dynamic cross-correlation matrix for motions of all Ca atoms in apo 
MLL3ge7 and MLL3¢zrz in the M3RA complex over the course of the 
simulation. The right panel shows enlarged cross-correlation maps of the 
SET-I motif. e, Dynamic cross- correlation matrix for motions of all 

Ca atoms in apo MLLI{i; and MLLIM, in the MIMRA complex over the 
course of the simulation. The right panel shows enlarged cross-correlation 
maps of the SET-I motif. f, The most highly correlated residues of the 
SET-I motif by molecular dynamics simulation are indicated by red lines. 
Left panel is for apo MLL1{)y and right panel is for MLL1%,, in the 
MLLI™RA complex. Red lines are connected Ca atoms for pairs of 
residues with calculated correlation coefficients greater than 0.55. 
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Extended Data Figure 7 | Association of RBBP5-ASH2L increases Tyr4825. These distances are almost fixed in the M3RA complex, while 
the binding affinities of MLL to cofactor and substrate peptide. the distances in apo MLL3 have large variations, explaining why the MLL3 
a, ITC measurement of interactions of AdoMet with MLL3spr alone complex has a higher binding affinity to cofactor than apo MLL3 does. 
(blue) and the M3RA complex (red). The insets show the ITC titration e, The potentials of mean force for the methyl transfer reaction along the 
data. b, Equilibrium dissociation constants between cofactor and MLL reaction coordinate range from —1.5 to 2.0 A with an interval of 0.1 A. It 
proteins obtained from ITC measurements. c, Fluorescence polarization clearly shows that the MLLsg7-RBBP5-ASH2L complex is more energetic 
assay shows that RBBP5-ASH2L increases the binding affinity between favourable for the methyl transfer reaction than MLLspr alone. f, The 
MLL3 and the H3 peptide substrate. d, Molecular dynamics simulation space-filling surface model shows that the Ly4"® binding channel exhibits 
show dynamics of the cofactor binding pocket. Top, the distance between open and closed conformations in the M3RA and M3RA-H3 structures. 


AdoHcy and Tyr4825; bottom, the distance between Arg4845 and 
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Extended Data Figure 8 | A conserved activation mechanism for 
SET-domain-containing HKMTs. a, Structural comparison of MLL3ser 
in the M3RA-H3-AdoHcy complex, and the SET domains of CLR4 (PDB 
IMVH), DIM-5 (PDB 1PEG), EZH2 (PDB 5CH1), ASH1 (PDB 3O0PE) 
and NSD1 (PDB 3001). Histone H3 peptide and AdoHcy in the CLR4 
structure were modelled based on the M3RA-H3-AdoHcy complex 
structure. RBBP5as and the corresponding activation segments in these 
proteins are almost identical in overall conformation (coloured in cyan). 
The recently reported EZH2 complex structure also revealed such an 
activation segment. Most notably, an aromatic residue (shown as stick 
model), equivalent to Phe336 in RBBP5, stacks with another two aromatic 
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DIM-5**S 


residues to form an aromatic cage to sandwich a conserved arginine. 
Another conserved hydrophobic residue (shown as stick model) is also 
important for the stable association between the activation segment and 
the SET-I motif. b, Sequence alignment of the activation segments of 
RBBP5 and several representative HKMTs, including members from the 
SUV39 and SET2 families. c, Gel-filtration profiles and SDS-PAGE for 
DIM-5 and DIM-5AAS show that activation segment does not affect 
protein folding. DIM-5AAS denotes DIM-5 (residues 51-319) that does 
not contain the activation segment. d, HKMT activities of DIM-5 and its 
mutants. Activities of mutant proteins are normalized to the activity of the 
wild-type protein setting at 100%. Mean + s.d. (n= 3) are shown. 
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Extended Data Table 1 | Data collection and refinement statistics for MLL3s¢7 and the MLL3se7-RBBP5as-asm—ASH2Lspry complex 


Data collection 
Space group 
Cell dimensions 
a, b, c (A) 

a, B, y (°) 
Wavelength (A) 
Resolution (A) 
Rmerge 

I /ol 
Completeness (%) 
Redundancy 


Refinement 
Resolution (A) 
No. reflections 
Rwork/| Riree (%) 
No. atoms 
Protein 
Ligand 
Water 
B-factors (A’) 
Protein 
Ligand 
Water 
R.m.s deviations 
Bond lengths (A) 
Bond angles (°) 


MLL3set Native ' MLL3set Peak 
(Zn-SAD)! 

P4132 P4132 

129.056 129.323 

90 90 

0.9793 1.2818 

50-2.8 . 50-3.4 

0.135(0.530) 0.204(0.563) 

31.6(7.0) 31.0(12.3) 

99.9(100) 99.9(100) 

10.1(10.6) 13.3(13.9) 

38.9-2.8 

9521 

18.0/22.9 

1198 

27 

52 

36.22 

33.13 

32.30 

0.011 

1.039 


*Values in parentheses are for the highest-resolution shell. 


tThe data are collected from one crystal. 


MLL3se1-RBBP5as-agu- 
ASH2Lspry* 


P24212 


80.342,236.076,44.416 
90, 90, 90 

1.1272 

100-2.4 

0.110 (0.654) 

32.6 (5.1) 

99.9(100) 

7.1 (7.3) 


44.4-2.4 
33458 
18.0/22.8 
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Extended Data Table 2 | Data collection and refinement statistics for MLL1g22°1!/ 03867! and MLL1g¢P01!/ 23867!_RBBP5as.asm-ASH2Lspry 
complex 


N38611/Q3867L_ 
MLL1 


MLL1 N38611//Q3867L -RBB Picea 


SET SET 
ASH2Lgpry' 

Data collection 
Space group P3221 C2 
Cell dimensions 
a, b, c (A) 54.547,54.574,104.656 74.966, 44,410, 117.792 
a, B, y (°) 90,90,122 90,106.157,90 
Wavelength (A) 0.9785 0.9792 
Resolution (A) 50-1.8 . 50-1.9 
Rmerge 0.090(0.421) 0.158 (0.548) 
I /ol 36.5(4.1) 11.3(2.4) 
Completeness (%) 100(100) 99.7(99.8) 
Redundancy 9.6(9.9) 3.6 (3.3) 
Refinement 
Resolution (A) 28.1-1.8 37.5-1.9 
No. reflections 17284 29678 
Rwork/Riree (%) 20.2/23.6 16.6/21.3 
No. atoms 

Protein 1156 2804 

Ligand 27 27 

Water 142 361 
B-factors (A’) 

Protein 45.34 20.31 

Ligand 33.19 27.14 

Water 45.77 29.99 
R.m.s deviations 

Bond lengths (A) 0.011 0.008 

Bond angles (°) 1.105 1.044 


{The data are collected from one crystal. 


«Values in parentheses are for highest-resolution shell. 
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The host galaxy of a fast radio burst 


E. E Keane!’, S. Johnston‘, S. Bhandari*, E. Barr?, N. D. R. Bhat*°, M. Burgay®, M. Caleb?+’, C. Flynn’, A. Jameson?”, 
M. Kramer®.”, E. Petroff?**, A. Possenti®, W. van Straten’, M. Bailes*, S. Burke-Spolaor!, R. P Eatough®, B. W. Stappers’, 
T. Totani!, M. Honma!*’, H. Furusawa”, T. Hattori!*, T. Morokuma!*®, Y. Niino!’, H. Sugai!®, T. Terai!4, N. Tominaga!®!’, 

S. Yamasaki", N. Yasuda!®, R. Allen’, J. Cooke”, J. Jencson!®, M. M. Kasliwal'®, D. L. Kaplan”, S. J. Tingay*, A. Williams®, 
R. Wayth*», P. Chandra”®, D. Perrodin®, M. Berezina®, M. Mickaliger® & C. Bassa”! 


In recent years, millisecond-duration radio signals originating in 
distant galaxies appear to have been discovered in the so-called 
fast radio bursts'~°. These signals are dispersed according to a 
precise physical law and this dispersion is a key observable quantity, 
which, in tandem with a redshift measurement, can be used for 
fundamental physical investigations!”!'. Every fast radio burst has 
a dispersion measurement, but none before now have had a redshift 
measurement, because of the difficulty in pinpointing their celestial 
coordinates. Here we report the discovery of a fast radio burst and 
the identification of a fading radio transient lasting ~6 days after 
the event, which we use to identify the host galaxy; we measure the 
galaxy’s redshift to be z= 0.492 + 0.008. The dispersion measure 
and redshift, in combination, provide a direct measurement of the 
cosmic density of ionized baryons in the intergalactic medium of 
Qiem= 4.9 + 1.3 per cent, in agreement with the expectation from 
the Wilkinson Microwave Anisotropy Probe”, and including all 
of the so-called ‘missing baryons. The ~6-day radio transient is 
largely consistent with the radio afterglow of a short +-ray burst', 
and its existence and timescale do not support progenitor models 
such as giant pulses from pulsars, and supernovae. This contrasts 
with the interpretation® of another recently discovered fast radio 
burst, suggesting that there are at least two classes of bursts. 

As part of the SUrvey for Pulsars and Extragalactic Radio Bursts 
(SUPERB) project at the Parkes radio telescope, we perform real-time 
searches of the sky at high time resolution (see Methods). On 2015 
April 18 utc we detected a fast radio burst (FRB) which we refer to as 
FRB 150418. Its dispersion measure (DM) is 776.2(5) cm~ “pc, which is 
4.1 times the maximum Galactic contribution expected from this line of 
sight through the Milky Way'*. (Throughout, the figure in parentheses 
indicates the uncertainty in the last digit.) The urc at which the FRB 
was detected is 04:29:07.056 at 1,382 MHz, or 04:29:05.370 at infinite 
frequency with the dispersion delay removed (Fig. 1). The observed 
pulse width is 0.8 + 0.3 ms, consistent with dispersion smearing due to 
the finite frequency resolution of the spectrometer, indicating that the 
intrinsic width is unresolved. No scattering is evident, consistent with the 
expected contribution from the Galaxy, which is <1 ms at this latitude’. 
The linear polarization is not large at 8.5 + 1.5% and the circular polariza- 
tion is consistent with zero. Given the low level of linear polarization the 
rotation measure is not known precisely and is 36 +52radm~. In 13h 
of follow-up observations no repeat burst was detected (see Methods). 


Upon detection of FRB 150418 at Parkes, a network of telescopes was 
triggered across a wide range of wavelengths (see Methods). Beginning 
two hours after the FRB, observations with the Australia Telescope 
Compact Array (ATCA) were carried out at 5.5 GHz and 7.5 GHz, 
identifying two variable compact sources. One of the variable sources is 
consistent with a GHz-peaked-spectrum source, with a positive spec- 
tral index, as previously identified in observations at these frequen- 
cies!®, The other variable source (right ascension, RA 07h 16 min 34.6 s; 
declination, dec. —19° 00’ 40”), offset by 1.944 arcmin from the centre 
of the Parkes beam, was seen at 5.5 GHz at a brightness of 0.27(5) mJy 
per beam just 2h after the FRB. The source was then seen to fade over 
subsequent epochs, settling at a brightness of ~0.09(2) mJy per beam 
(Fig. 2). The source is also seen at 7.5 GHz at 0.18(3) mJy per beam 
in the first epoch but subsequently not detected. These observations 
indicate a ~6-day transient with a negative spectral index; we obtain 
a= ~—1.37 in the first epoch, for a power-law spectrum of the form 
F,xv. The subsequent quiescent level is consistent with the level 
expected'’ from an early-type galaxy at z~ 0.5. To estimate the likeli- 
hood that this transient could occur by chance we consider the results 
of radio imaging surveys (see Methods). By comparing to a recent 
survey with the Very Large Array'® in the 2-4 GHz band, we expect a 
95% (99%) confidence upper limit of <0.001 (<0.002) such transients 
to occur in the ATCA observations of the FRB field, or equivalently an 
upper limit chance coincidence probability of <0.1% (<0.2%). We find 
that the detection of a fading transient source is therefore sufficiently 
rare that we conclude that it is the afterglow from the FRB event. 

We obtained optical observations of the field using Suprime-Cam 
on the 8.2-m Subaru Telescope on 2015 April 19 and April 20. From 
these images, we identify a source within the ~1-arcsec positional 
uncertainty derived from the ATCA image (Fig. 3). The source is seen 
to be a galaxy with a half-light radius of 1.40.2 arcsec with a surface 
brightness profile well fitted by de Vaucouleurs’ law (see Methods), 
thus consistent with being an elliptical galaxy. In addition to the Subaru 
r’ and i’ band photometry, we obtained J, H and K, band photometry 
with the Wide-field Infrared Camera on the Palomar 200” telescope. 
The source is also detected in the WISE W1 and W2 filters, enabling a 
seven-band spectral energy distribution fit (see Extended Data Fig. 1) 
and the determination of a photometric redshift of 0.48 <z< 0.56 
(68% confidence). To confirm this redshift, we performed a 3-h spectro- 
scopic observation in good seeing conditions (full-width at half-maximum, 
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FWHM #028 arcsec) using the Faint Object Camera and Spectrograph 
(FOCAS) on Subaru on 2015 November 03. This yielded a spectrum con- 
sistent with a reddened elliptical galaxy at z= 0.492 +0.008 (Fig. 3). An ear- 
lier 1-h observation, on a night which was not spectrophotometric, using 
the Deep Imaging Multi-Object Spectrograph (DEIMOS) on the Keck 
telescope, was also taken and found to be consistent with the Subaru result. 

Dispersion in the intergalactic medium is related to the cosmic 
density of ionized baryons (jg and the redshift!?”° according to the 
following expression: 


— 3cHo Mem f2 (1 +2/)f (2’)dz’ 


DMicm= eee eee eh ee 
IGM 8nGmy MN [(1+z’PQa+ Qy)°? 


(1) 


Here, we take DMjgy = DMgrg — DMyw — DMhalo — DMhost(1 +z)! 
(see Methods). It is appropriate to account for a Milky Way halo con- 
tribution”! of DMnalp = 30cm pe and we derive 189cm~* pc for 
DMmw (the DM for the rest of the Milky Way, excluding its halo, 
that is, the non-dark-matter component of the Milky Way) from the 
NE2001 Galactic electron density model’. An elliptical galaxy can be 
modelled” with a modified version of NE2001 with an average rest-frame 
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Figure 2 | The FRB host galaxy radio light curve . Detections have lo 
error bars, and 3c upper limits are indicated with arrows. The afterglow 
event appears to last ~6 days, after which time the brightness settles at the 
quiescent level for the galaxy. Here myp denotes modified Julian day. 
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Figure 1 | The FRB 150418 radio signal. 

a, A waterfall plot of the FRB signal 

with 15 frequency sub-bands across the 

Parkes observing bandwidth, showing the 
characteristic quadratic time-frequency sweep. 
To increase the signal-to-noise ratio, the 

time resolution is reduced by a factor of 14 
from the raw 64-1s value. b, The pulse profile 
of the FRB signal with the total intensity, linear 
and circular polarization flux densities shown 
as black, purple and green lines respectively. 

c, The polarization position angle is shown with 
1c error bars, for each 64-1s time sample where 
the linear polarization was greater than twice 
the uncertainty in the linear polarization. 


value of ~37cm~* pc. The NE2001 components are uncertain at the 20% 


level'4 


, and there is an additional uncertainty of ~100 cm~? pc owing to line- 


of-sight inhomogeneities in the intergalactic medium!°. We therefore obtain: 


Mom= [2 loos +0.013 


0.88 


(2) 


e 


The ionization factor is f.=1 for fully ionized hydrogen, whereas 
allowing for a helium abundance of 25%, f, = 0.88 is appropriate”. 


Table 1 | Summary of FRB 150418 observed and derived properties 


Event time at 1,382 MHz 


Event time at infinite frequency 


Parkes beam number 
Beam FWHM diameter 
Beam centre (RA, dec.) 
Beam centre (/,b) 
Galaxy position (RA, dec.) 
ATCA epoch 1 

Galaxy position (/,b) 
ATCA epoch 1 
Signal-to-noise ratio 
Observed width, Wops 
FRB dispersion measure 
Dispersion index, 3 


Milky Way dispersion measure 


Redshift, z 
Peak flux density, S1382muz 


Fluence, Fy3gomH2z Jyms) 


Linear polarization, L// 
Circular polarization, |V|// 
Rotation measure 
Spectral index, a 


Comoving distance 
Luminosity distance 
Energy 

Luminosity 


2015 April 18 04:29:07.056 utc 
2015 April 18 04:29:05.370uTC 

4, inner ring 

14.1’ 

07h16min30.9s, —19°02/24.4” 
232.684°, —3.261° 
07h16min35(3)s, —19°00'40(1)” 


232.6654(1)°, -3.2348(3)° 


39 

0.8+0.3ms 

776.2(5) cm~? pce 

—2.00(1) 

188.5 cm-3pc 

0.492(8) 

>2.2*8§ Jy (beam centre) 
>2.4*85 Jy (galaxy position) 
1.9*3:4 Jyms (beam centre) 
2.0133 Jyms (galaxy position) 
8.541.5% 

<4.5% (30) 

36+52radm-* 

a> —3.0 (30, Parkes-MWA) 
a=+1.3+0.5 (Parkes) 

1.88 gigaparsecs 

2.81 gigaparsecs 

8t} x 1038 ergs (galaxy position) 
>1.3 x 10*4 ergs s~! (galaxy position) 


The peak flux density is a band average and a lower limit due to the intrinsic width not being 
resolved; similarly luminosity is a lower limit. The energy quoted is the product of the band- 
averaged fluence, the blueshifted effective bandwidth of the observations and the square of the 
luminosity distance. MWA denotes the Murchison Widefield Array. 
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Figure 3 | Optical analysis of the FRB host galaxy. a, A wide-field 
composite false-colour RGB (red-green-blue) image, overplotted with 
the half-power beam pattern of the Parkes multi-beam receiver. Panels 
b and c show successive zooms on the beam 4 region, and on the fading 
ATCA transient location respectively. P200 Ks denotes the Palomar 
telescope 200-inch K, band. Panel d is further zoomed in with the error 
ellipses for the ATCA transient, as derived from both the first and 


From fitting ACDM cosmological models to Wilkinson Microwave 
Anisotropy Probe (WMAP) observations one derives!” the cosmic 
density of all baryons Qaryons = 0.046 + 0.002. Of these, about 10% 
are not ionized or are in stellar interiors” so that we expect to meas- 
ure a cosmic density of ionized baryons in the intergalactic medium 
of Qgm 0.9 X Aparyons ¥ 0.041 + 0.002. Thus, our measurement 
independently verifies the ACDM model and the WMAP observa- 
tions, and constitutes a direct measurement of the ionized material 
associated with all of the baryonic matter in the direction of the FRB, 
including the so-called “missing baryons”™*. Alternatively, if we take 
21cm = 0.041, our measurements show that DMhos is negligible. 
FRB localization allows us to correct a number of observable quantities 
that are corrupted by the unknown gain correction factor owing toa lack 
of knowledge of the true position within the telescope beam. Accounting 
for the frequency-dependent beam response”° we can derive a true spectral 
index for the FRB. We obtain a=+1.3£0.5 for a fit centred at 1.382 GHz. 


second epoch, overplotted. e, The Subaru FOCAS spectrum de-reddened 
with E(B — V) = 1.2, smoothed by five pixels and fitted to an elliptical 
galaxy template at z= 0.492, denoted by the blue line. Common atomic 
transitions observed in galaxies are denoted by vertical dashed lines 

and yellow lines denote bright night sky lines. The Subaru r’ and 7’ filter 
bandpasses are denoted by light red and grey background shading. The red 
line is the 1a per pixel uncertainty (not smoothed). 


Similarly, we derive a corrected flux density and fluence, and these, in com- 
bination with the redshift measurement, enable us to derive the distances, 
the energy released, the luminosity and other parameters (Table 1). 

In considering the nature of the progenitor we first consider the host 
galaxy. An upper limit to the star-formation rate can be determined 
from the upper limit Ha luminosity indicated by the Subaru spectrum 
(see Methods) to be <0.2Meyr~', where Me is the solar mass. Such 
a low star-formation rate implies, in the simplest interpretation, that 
FRB models directly related to recent star formation such as magnetar 
flares or blitzars are disfavoured. This problem might be avoided if either 
some residual star formation occurred in an otherwise ‘dead’ galaxy, 
or if the FRB originated in one of the many satellite galaxies that are 
expected to surround an elliptical galaxy at this redshift, but that cannot 
be resolved in our observations. Failing these, the lack of star formation 
favours models such as compact merger events. This may be supported 
by the existence of the ~6-day radio transient, which we interpret as 
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the afterglow from the FRB. The presence of this short-duration radio 
afterglow would not be expected from models involving giant pulses 
from pulsars (which last only a few nanoseconds), whereas supernovae 
and tidal disruption events would be expected to have a longer-lasting 
afterglow”®. The afterglow is consistent with the four short +\-ray bursts 
for which radio afterglows have been detected”’. The stellar mass of 
the galaxy, derived from the optical photometry and spectroscopy (see 
Methods), is ~10!!Mo, which is in accord with the masses of elliptical 
host galaxies of short >\-ray bursts”®. For a duration of 6 days for the after- 
glow we infer a brightness temperature of ~10'*K, above the Compton 
cooling limit of 10'?K, implying a modest Doppler boosting factor. It 
is possible that the initial variability in brightness and spectral index is 
caused by scintillation, similar to that seen in the light curves of -+-ray 
bursts”’. If so, this indicates a very compact initial source size. 

Our conclusion that FRB 150418 is likely to be from a one-off event 
in an older stellar population may be at odds with the recent discovery® 
of FRB 110523. It is possible that there are two or more classes of FRB 
progenitor. These FRBs differ in their observed pulse widths and we 
speculate that this parameter might act as a discriminator between 
differing progenitors. FRB 150418 is not resolved (in time), but 
FRB 110523 appears to be (see Methods). This trend is seen in the 
published FRB population: some are clearly resolved, some are clearly 
not, although extra care needs to be taken in re-analysing this, as very 
slight errors in DM, as well as the effect of multi-path scattering, can 
make an FRB appear to have a longer intrinsic timescale. 

Unresolved events like FRB 150418 might thus be ascribed to cataclys- 
mic events (such as short +\-ray bursts), while events like FRB 110523, 
where the intrinsic timescale is ~1 ms, could be associated to magnetar 
flares*°. If so, events like FRB 150418 are unique events, whereas events 
like FRB 110523 could repeat, although the timescale for this is unclear. To 
answer these questions, and others such as whether or not some or all FRBs 
are standard candles, this study will need to be repeated for a large number 
of FRBs, and requires continued monitoring of the known FRB fields. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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redshift and produced the optical spectrum plot. J.J. performed the Palomar 
observations. M.M.K. performed the Keck observation. MWA observations were 
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DM contributions. When radio signals propagate through a plasma, the travel 
time is longer than the light travel time in a vacuum. The additional delay depends 
on the radio wave frequency, f, in units of GHz, and obeys a precise physical law 
of the form: tpy=4.149 ms x (DM x f~?), where the DM is the integrated elec- 
tron density along the line of sight in units of cm™ pc. If travelling across cosmo- 
logical distances there are several contributions to the observed DM—from the 
host, the intergalactic medium and the Milky Way. The quantity of interest is the 
intergalactic medium component because this can be used, in conjunction with 
redshift measurements, to perform a number of fundamental studies, for exam- 
ple, detecting the missing baryons!°, determining the dark energy equation of 
state’! and, if the signal is linearly polarized, measuring magnetic field strengths 
in the intergalactic medium*!. For these applications the DM contributions other 
than those from the intergalactic medium are essentially foregrounds which must 
be understood so that they can be removed. Here we examine each contribution 
in turn and note the uncertainties in each case; it is important to note that the 
precision in the observed DM value is high (see Table 1) and does not therefore 
contribute to the uncertainty in determining the DM for the intergalactic medium. 

The Milky Way component consists of two parts, the first of which is that due to 
the interstellar medium. This is in principle known", because the NE2001 model 
of the Galaxy’s electron density can determine this to an accuracy of approximately 
20%. The second Milky Way component is the contribution from the dark matter 
halo, which is thought to exist, yet which is not included in the NE2001 model. We 
follow previous work”, which has calculated this term to be 30cm™ pc. It is unclear 
how to assign an uncertainty to this component so (considering that the other 
components are dominant regardless) we take it to be zero. The host component 
is suppressed by a factor of (1-+z)'. Its magnitude depends on both the nature of 
the progenitor and of the host galaxy. 

The observed FRB signal can be used to constrain the progenitor’s local DM 
contribution—a non-negligible contribution could imply a high density of elec- 
trons to be located close to the source. Such a high density configuration®” could 
produce higher-order dispersion terms (that is, the pulse’s frequency dispersion 
would deviate from the quadratic form), could result in plasma frequencies 
comparable to the emitted frequency (that is, radiation would not escape), and 
could produce scattering in the pulse profile. None of these effects are observed 
for FRB 150418, implying that any local-progenitor DM component is negligible. 

The host galaxy contribution has been examined recently” for spiral, dwarf and 

elliptical galaxies. This work considered modified versions of NE2001, with various 
sub-components of the model included or excluded as appropriate and suitable 
scalings to the Ha luminosity applied. For an elliptical galaxy, which is relevant in 
the case of FRB 150418, the average DM contribution over all inclination angles 
is 37 cm pc (this is the value before being suppressed by the (1 +z)! factor) and 
we use this as our estimate of the host contribution. As this is based upon NE2001 
we assume that a 20% uncertainty applies. In addition to the uncertainties men- 
tioned already the intergalactic medium component itself is uncertain at the level 
of ~100cm™ pc, owing to inhomogeneities between different lines of sight through 
the intergalactic medium”. 
SUPERB. SUPERB is a project ongoing at the 64-m Parkes radio telescope since 
April 2014 with goals of discovering FRBs and pulsars. The central frequency of 
the survey is 1.382 GHz, with a bandwidth of 400 MHz, of which about 340 MHz 
is typically usable. It uses optimized graphics processing unit codes for performing 
real-time radio frequency interference mitigation and searches for short-duration 
radio bursts and pulsars in relativistic binary systems. In the real-time search we 
use the following criteria to define candidate FRB events: (1) the DM of the burst 
must be at least 1.5 times the expected maximum Milky Way contribution; (2) 
the signal-to-noise ratio must be at least 10; (3) the signal cannot be detected in 
more than four beams of the 13-beam receiver used for the SUPERB project at 
Parkes—an event detected in more beams cannot originate from a boresight signal 
and therefore cannot be of a celestial origin; (4) the width must be less than or 
equal to 8.192 ms, that is, 128 times our native time sampling of 641s; and (5) the 
number of independent events detected in a 4-s window centred on the event in 
question must be no greater than 5. The lag between the FRB signal hitting the dish 
and our software informing us of the detection’ is only ~10s. We further search 
the data, offline, with more stringent interference rejection and covering corners 
of parameter space ignored for expediency during the real-time search. We note 
that since instigating this search system at Parkes no FRB has been missed by the 
real-time search pipeline, including FRB 150418. 

FRB 150418 was detected in beam 4 of the 21-cm multi-beam receiver. The FRB 
profile was fitted simultaneously for time of arrival, DM, width, amplitude and 
dispersion index using 4 and 8 different sub-bands. The results were consistent with 
an unresolved pulse, where the width is purely given by the dispersion smearing 
across the 390.625-kHz filterbank channels. Uncertainties were determined using 
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CERN’s MINUIT packages (http://seal.web.cern.ch/seal/snapshot/work-packages/ 
mathlibs/minuit/). The burst was found to have a DM of 776.2(5) cm“? pe, anda 
dispersion index of = —2.00(1), where the dispersion delay is proportional to v’, 
and so is consistent with propagation through a cold plasma. 

The gain of beam 4 is well fitted by a Gaussian’ with FWHM of 14.1 arcmin, so 
to derive corrected values for the flux density, fluence, and so on we boost the meas- 
ured values by a factor of exp(In2(20/FWHM)’), where 0 is the offset of the signal 
from the beam centre. The offset between the ATCA position determined from the 
first epoch (RA 07h 16 min 34.557 s, dec. —19°00/ 39.954”) and the centre of the 
Parkes beam (RA 07h 16 min 30.9s, dec. -19°02/ 24.4”) is 0= 1.944 arcmin, yield- 
ing a boost factor of 1.054. The observed peak flux density, if the FRB were at the 
centre of the beam, is 2.2 Jy with a corresponding fluence of 1.9 Jy ms; correcting 
these to the location of the host galaxy, we estimate values of 2.4 Jy and 2.0 Jy ms. 

A calibration observation was taken 17 min post-burst and the polarization 
calibration was performed using the PSRCHIVE software package*’. On the basis 
of observations of PSRJ1644-4459, a bright polarized pulsar which we use to 
calibrate the off-axis response, taken three days before the FRB, we determine 
that the difference in the Jones matrix coefficients is not statistically different off- 
boresight. Therefore the boresight calibration was used to determine the polariza- 
tion fraction of the pulse. This FRB is not seen to have a large linear polarization 
L, the rotation-measure-corrected linear polarization is L/I= 8.5 + 1.5% (where 
Tis total intensity) and the circular polarization is consistent with zero. An addi- 
tional systematic uncertainty exists in the leakage of total intensity to polarization. 
Our analysis of PSR J1644—4459 provides an upper limit on the magnitude (but 
not orientation) of the leakage vector that is <6% of the total intensity, meaning 
that the true L/I value may be either smaller or larger than quoted by up to this 
amount. Owing to the low linear polarization the rotation measure estimate is not 
very precise at 36 + 52 rad m™~. As the rotation measure is consistent with zero, 
examination of the intergalactic medium magnetic field strength is not possible 
with this FRB, although for completeness we note that the 3c upper limit on the 
electron weighted intergalactic medium magnetic field strength is ~0.4 1G for 
this line of sight. There is no evidence for a large host contribution to the rotation 
measure for this FRB, although we note that an extremely large rotation measure 
exceeding 10°radm~* would result in depolarization within a single frequency 
channel, meaning we are insensitive to such large values. 

The Murchison Widefield Array (MWA)** was shadowing our Parkes obser- 
vations but did not detect a counterpart. The resulting 30 fluence upper limit of 
1,050 Jy ms at 185 MHz gives us the first simultaneous multi-frequency observation 
of an FRB, and hence the first broadband limit on the spectral index. The spectral 
index limit from the Parkes and MWA data combined is a > —3.0. Properties of 
the FRB are summarized in Table 1. 

Follow-up observations. After the discovery of the FRB we triggered observations 
at numerous telescopes and performed a calibration observation at Parkes. We 
continued to observe with the Parkes telescope, obtaining 4.5 h of observation over 
the course of the next 7.5h, in order to search for any repeat bursts. The MWA 
was shadowing during the discovery observation and continued to track the FRB 
position for another ~7.5h. The ATCA was on source 2h after the burst and also 
observed until T+ 7.5 h (where T is the time of the FRB) when the source set at 
both Parkes and ATCA. Swift was on source 8 h after the burst, and 10 h after the 
burst the Lovell telescope continued the monitoring for 2.5h. On April 19 and 20 
we obtained optical observations with Subaru, and on April 20 and 21 continued 
to search for repeated radio bursts with the Effelsberg, Sardinia and Parkes radio 
telescopes. The longer term follow-up campaign consisted of radio imaging (four 
further ATCA epochs, three GMRT epochs), high time resolution radio (with 
the Lovell telescope), X-ray (one further Swift epoch), optical photometry (with 
the Palomar telescope) and optical spectroscopy (with the Keck and Subaru tele- 
scopes). We did not detect any subsequent bursts in our high time resolution radio 
follow-up (limiting flux densities in Extended Data Table 1); however, regular 
emission at a much weaker level cannot be ruled out. The follow-up observations 
are summarized in Extended Data Table 1. We additionally note that no +-ray 
burst was detected in temporal coincidence, or in the months before, by telescopes 
on either of the Fermi or Swift satellites. Furthermore, at a comoving distance of 
1.8 gigaparsecs or a luminosity distance of 2.8 gigaparsecs, this galaxy is beyond the 
LIGO* horizon for gravitational wave signatures from short -\-ray bursts. 

Imaging transients. The radio transient sky is not very well studied at frequencies 
of 5.5GHz and 7.5 GHz to the flux density levels relevant to this study. Additionally 
there are no archival data of the FRB field with which to compare our follow-up 
observations. To estimate the likelihood of the 6-day fading transient being 
detected by chance in our ATCA follow-up of the Parkes FRB field we first con- 
sidered a previous ATCA study’®. This is the only such work performed on the 
same timescales and at the same observing frequency using the ATCA. In that 
work, which also covered a wider area of sky than our FRB follow-ups and to a 


© 2016 Macmillan Publishers Limited. All rights reserved 


LETTER 


deeper level, no transient sources were discovered. A 95% (99%) confidence upper 
limit event rate of <7.5 deg? (<11.1 deg”) with a flux density of >69 ,,Jy/beam 
was obtained. Scaling this to obtain the expectation of transient with flux density 
in excess of 200 \.Jy yields an upper limit event rate of <1.5 deg (<2.2 deg). 
Considering the 0.04-deg” field-of-view of a Parkes beam, which corresponds to 
the uncertainty in the FRB’s position, the upper limit on the expected number of 
events in our follow-up observations is thus <0.06 (<0.09). We can rephrase this 
by interpreting the upper limit number of expected events as the upper limit on , 
the Poisson rate parameter; then the probability of a chance temporal coincidence 
is P(1; \) = Aexp(—A). This yields an upper-limit probability of <6% (<8%). 

One can also obtain estimates of the false-positive rate from considering 
studies**“” at other telescopes. As other studies are not ideally matched in terms 
of observing frequency and sensitivity, one must scale the findings appropriately. 
For example, for studies performed at different observing frequencies we must 
scale the flux densities by a spectral index; we adopt the spectral index of our 6-day 
transient as measured in the first epoch of our follow-up. Additionally, sensitivity 
levels must be scaled to the 200-1Jy level; for this operation we adopt the standard 
N«S~*” scaling. With this approach we consider a recent deep Very Large Array 
study! which operated at 2-4 GHz. Applying the appropriate scaling, this study 
yields an expected number of events in our follow-up observations of <0.001 
(<0.002) at 95% (99%) confidence. The equivalent upper limit chance temporal 
coincidence probability is <0.1% (<0.2%). This result is more constraining than 
the ATCA-derived numbers by a factor of at least 60. From this assessment we 
deem it statistically unlikely that we would have detected, by chance, this fading 
negative spectral index radio source at this location and time, resulting in our 
interpretation that this source is likely to be associated with the FRB. 

Ideally, we might expand upon this calculation by estimating probabilities of 
chance coincidence in each independent wave-band (radio, optical and X-ray) in 
our follow-up campaign, and then compute a joint probability of a transient occur- 
ring in any of the bands. It is unclear what the appropriate statistics are for the X-ray 
and optical bands, but if we take the observed +\-ray burst and supernova rates as 
indicative, we find that the upper limit expectation in these wave bands is much 
smaller than in the radio bands. Deeper all-sky radio transient surveys are therefore 
the key to tightening constraints on transient associations like that reported here. 
Optical analysis. The i’ optical profile of the Galaxy was fitted with a Sersic func- 
tion of the form I(r) « exp(—kR"”), The best-fit parameters (see Extended Data 
Fig. 2) are a Sersic index of n= 3.6 +0.5, consistent with the n= 4 value seen in 
elliptical galaxies. The half-light radius is R. = 6.9 + 1.2 pixels, or Re = 1.4+0.2 
arcsec. At z= 0,492 the angular diameter distance is 1.62 gigaparsecs so that this 
implies Re= 10.9 + 1.8 kpc as the physical half-light radius of the galaxy. The pro- 
file fitting also yields an estimate of the major to minor axes ratio for the galaxy 
of b/a=0.68 + 0.03. We note that there are additional systematic uncertainties 
involved in the fit of the Sersic index, depending on the exact method of sky sub- 
traction employed; our analysis suggests that the systematic errors are probably of 
equal magnitude to the statistical errors quoted above. 

We obtained the following photometry of the FRB host galaxy: with Subaru 
Suprime-Cam*! we determined AB magnitudes of 23.45(16) and 22.07(31), for 
the r’ and i’ bands respectively. Between the two Subaru epochs no variability is 
seen—a subtraction of the epochs yields an upper limit on any variation of 25.2 and 
24.7 magnitudes (5c). The galaxy is also detected in two WISE filters with Vega 
magnitudes of 15.204(0.044) and 15.050(082), for the W; and W> bands respec- 
tively. With the Palomar 200” (P200) telescope we obtained further information, 
obtaining Vega magnitudes of 18.92(10), 17.55(25) and 16.51(05) for the J, H and 
K; bands respectively. We corrected the observed photometry for the Milky Way 
extinction using Ay =3.7 mag and standard extinction coefficients”, and converted 
those magnitudes into flux densities using the established zero-points***, We 
fitted for the photometric redshift using the 2015 November version of EAZY*, 
finding 68% confidence limits for 0.48 < z< 0.56. This was robust to different 
choices of galaxy template, with good overall fits (” of 5-8). We then fitted the 
spectral energy distribution of the host galaxy using MAGPHYS**. We were able to 
achieve an acceptable fit (see Extended Data Fig. 1) to all of the photometry with a 
model for a passive (<0.2Mo yr‘), massive (stellar mass ~10!'Mo) galaxy with a 
modest amount of dust (in-host extinction Ay in the range 0-4 mag). The exact fit 
was rather degenerate because of our limited wavelength coverage, with only the 
stellar mass well determined. Future observations at shorter wavelengths should 
be able to determine more robust properties. 

The spectrum that confirmed the redshift was obtained on 2015 November 2 
in a 3-h observation using FOCAS on the Subaru telescope. An earlier attempt to 
obtain a spectroscopic redshift on 2015 October 21 using DEIMOS on the Keck 
telescope in poorer sky conditions had proved difficult to calibrate and resulted 


in an imprecise redshift estimate, no better than the photometric estimate. In 
that case difficulties in calibration are compounded by the spectrum’s lack of any 
relevant emission lines. The well calibrated Subaru spectrum is found to be con- 
sistent with a reddened z= 0.492(8) early-type galaxy with E(B —V)=1.2+0.1 
(Fig. 3), noting that r’ and i’ are approximations to restframe B and V filters. 
This implies absolute magnitudes of Mg ¥ —21.6 and My + —22.1. As the galaxy 
is elliptical we can apply the Faber—Jackson relation’” to estimate the velocity 
dispersion of ~230kms '. From the virial theorem and the observed half-light 
radius we can thus estimate the stellar (and total) mass** to be ~10!'Mo (and 
~2 x 10'2M,j). An upper limit to the Ha luminosity of 2.6 x 10° erg s-! (3c) can 
be derived from the optical spectrum, and from this we can, in the standard way”, 
derive a star-formation rate of <0.2Meyr!. 

The very faint companion galaxy (r’ = 24.22(16) mag, i’ = 23.22(31) mag) visible 

to the northeast is not inconsistent with a galaxy at the same redshift. If confirmed 
the two galaxies may be in the process of merging. The absolute K-band magnitude 
of the galaxy is —25.7 mag (Vega). The radio continuum level can be estimated 
from this!” and is consistent with the quiescent level of the galaxy after the ~6-day 
fading event. This shows that the background level seen is not surprising for an 
early-type galaxy, and implies that the FRB afterglow had already faded below this 
level by the third ATCA epoch. 
FRB 110523. After the initial submission of this manuscript a study was published® 
announcing the discovery of FRB 110523, which we have compared and contrasted 
with FRB 150418 in the main text. There are now strong indications that there are 
two or more FRB progenitors, and we speculate that the observed pulse width 
may act as a useful discriminator between these. To this end we consider the time 
sampling and frequency resolution of the study that discovered FRB 110523, as 
well as its DM, in an effort to see whether the pulse was resolved in the Green Bank 
observations®. We would expect an unresolved FRB to have an observed width of 
no more than 1.08 ms, 1.14 ms and 1.26 ms in the highest, central and lowest fre- 
quency channels, respectively, in the Green Bank study’. This is inconsistent with 
the reported observed width (after the effects of scattering had been removed) at 
the 3c to 4c level, indicating that FRB 110523 appears to be resolved and therefore 
to have an intrinsic timescale of ~1 ms. 
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Extended Data Figure 1 | The photometric redshift of the FRB host spectral fits are shown, and these are denoted by the red and blue shading 
galaxy. A y’ fit of the redshift of the galaxy based on the spectral energy respectively. The spectral redshift is denoted by the dashed vertical 
(L,) distribution is shown. The photometric redshift determined from this _ line. The inset shows the spectral energy distribution fit, with the seven 
is 0.48 <z< 0.56 (68% confidence, denoted by the shaded regions). Two photometric estimates overplotted with 1c error bars. 
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Extended Data Figure 2 | The optical surface brightness profile of the angle (PA) are given in the inset. The model profiles and data are shown as 
FRB host galaxy. The surface brightness profile of the galaxy in the Subaru __ the flux along an ellipse as a function of semi-major axis. The image point 
i’ band image was fitted to an ellipsoidal Sersic function. Best-fit values spread function (PSF) profile is also shown as a function of radius. Error 


for the half-light radius (R,), Sersic index (n), axis ratio (b/a) and position bars give the root-mean-square scatter of the pixel counts along the axis. 


© 2016 Macmillan Publishers Limited. All rights reserved 


LETTER 


Extended Data Table 1 | Summary of follow-up observations of FRB 150418 


Time (UTC) Telescope Band Tobs (s) Mode  Level/limit 
2015-04-18-04:21:15 Parkes 1.4 GHz 561 TD? 2.2 Ty 

Shadowing MWA 185 MHz 27000 TD < 1050 Jy ms 
2015-04-18-04:31:08 Parkes 1.4 GHz 465 TD < 0.17(W/0.9 ms) °° Jy 


2015-04-18-05:04:35 Parkes 1.4 GHz 1181 TD < 0.17(W/0.9 ms)~°* Jy 
2015-04-18-06:30:15 ATCA 5.5,7.5GHz 19800 RI*  0.27(5) mJy/beam, 0.18(3) mJy/beam 
2015-04-18-07:46:27 Parkes 1.4 GHz 3618 TD < 0.17(W/0.9 ms)~°? Jy 
2015-04-18-08:47:28 Parkes 1.4 GHz 3618 TD < 0.17(W/0.9 ms)~° Jy 
2015-04-18-09:48:09 Parkes 1.4 GHz 3617 TD < 0.17(W/0.9 ms)~°* Jy 
2015-04-18-10:48:59 Parkes 1.4 GHz 3617 TD < 0.17(W/0.9 ms)~°* Jy 


2015-04-18-11:49:37 Parkes 1.4GHz 758 TD <0.17(W/0.9 ms)~°8 Jy 
2015-04-18-12:20:57 Swift X-ray 3976 PC < 7.1 x 10-4 erg/cm?/s 
2015-04-18-14:22:52 Lovell 1.4 GHz 7200 TD < 0.11(W/0.9 ms)~°° Jy 
2015-04-19-06:06:19 Subaru 7 600 Ph* —22.06(31) mag (AB) 
2015-04-19-06:37:12 Subaru r 900 Ph* 23.33(16) mag (AB) 
2015-04-20-05:50:27 Subaru id 1200 Ph* 22.08(31) mag (AB) 
2015-04-20-06:30:53 Subaru cr 1200 Ph* 23.59(16) mag (AB) 


2015-04-20-15:49:09  Effelsberg 1.4 GHz 8300 TD <0.11(W/0.9 ms)~°> Jy 
2015-04-21-06:40:42 Parkes 1.4 GHz 3600 TD <0.17(W/0.9 ms)~°? Jy 
2015-04-21-17:21:40 SRT 1.4 GHz 3600 TD <0.61(W/0.9 ms)~°> Jy 
2015-04-24-02:44:15 ATCA 5.5,7.5 GHz 72900 RI*  0.23(2) mJy/beam, < 0.08 mJy/beam 
2015-04-26-01:45:05 ATCA 5.5,7.5 GHz 74700  RI* 0.09(2) mJy/beam, < 0.08 mJy/beam 
2015-05-07-03:18:42 Swift X-ray 2908 PC  <9.3x 107 erg/cm?/s 
2015-05-18-12:30:00 GMRT 0.61GHz 7200 RI < 0.35 mJy/beam 
2015-05-22-12:42:00 GMRT 1.4 GHz 7140 ~—s RI < 0.15 mJy/beam 
2015-06-04-21:12:15 ATCA 5.5,7.5GHz 26700 RI* 0.11(2) mJy/beam, < 0.09 mJy/beam 
2015-10-15-05:32:23 Lovell 1.4 GHz 7200 TD  <0.14(W/0.9 ms)/? Jy ms 


2015-10-21-00:14:15 Keck OIR 3600 Sp* see Subaru observation 
2015-10-27-14:09:35 ATCA 5.5,7.5GHz 30600 RI*  0.09(2) mJy/beam, < 0.07 mJy/beam 
2015-10-31-11:15:54 P200 a 1080 Ph*  18.92(10) mag (Vega) 
2015-10-31-11:38:31 P200 H 450 Ph* = 17.55(25) mag (Vega) 
2015-10-31-11:51:15  P200 Kg 810 Ph* = 16.51(5) mag (Vega) 
2015-11-03-11:57:41 Subaru OIR 10800 Sp* see Figure 3 


2015-11-11-18:40:00 GMRT 1.4 GHz 22020 ~=s RI < 0.07 mJy/beam 


The start times, telescope name, observing band and observing duration are listed. In the fifth column time domain, radio imaging, photon counting, photometric and spectroscopic observations are 
denoted TD, RI, PC, Ph and Sp respectively, and detections (of either the FRB, fading transient or host galaxy) are denoted with an asterisk. The final column gives the detection level, or 30 upper limits 
in the case of non-detections. ‘Shadowing’ refers to when the MWA was observing simultaneously with the Parkes telescope. 
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Non-destructive state detection for quantum logic 
spectroscopy of molecular ions 


Fabian Wolf!, Yong Wan!+, Jan C. Heip!, Florian Gebert!, Chunyan Shi! & Piet O. Schmidt!? 


Precision laser spectroscopy! of cold and trapped molecular ions 
is a powerful tool in fundamental physics—used, for example, in 
determining fundamental constants’, testing for their possible 
variation in the laboratory*‘, and searching for a possible electric 
dipole moment of the electron®. However, the absence of cycling 
transitions in molecules poses a challenge for direct laser cooling 
of the ions®, and for controlling”"'' and detecting their quantum 
states. Previously used state-detection techniques based on 
photodissociation!” or chemical reactions!> are destructive and 
therefore inefficient, restricting the achievable resolution in 
laser spectroscopy. Here, we experimentally demonstrate non- 
destructive detection of the quantum state of a single trapped 
molecular ion through its strong Coulomb coupling to a well 
controlled, co-trapped atomic ion. An algorithm based on a state- 
dependent optical dipole force’ changes the internal state of the 
atom according to the internal state of the molecule. We show that 
individual quantum states in the molecular ion can be distinguished 
by the strength of their coupling to the optical dipole force. We also 
observe quantum jumps (induced by black-body radiation) between 
rotational states of a single molecular ion. Using the detuning 
dependence of the state-detection signal, we implement a variant 
of quantum logic spectroscopy'*”* of a molecular resonance. Our 
state-detection technique is relevant to a wide range of molecular 
ions, and could be applied to state-controlled quantum chemistry!” 
and to spectroscopic investigations of molecules that serve as probes 
for interstellar clouds'®*”. 

One of the salient features of trapped-ion systems is that the 
universal Coulomb interaction allows strong coupling of diverse 
quantum objects—such as different species of atomic ions, or of 
atomic and molecular ions. Being able to perform quantum logic 
operations (for example, in the form of gates'**°?!) between such 
quantum objects has proven useful for quantum information pro- 
cessing and quantum simulations. The use of quantum logic opera- 
tions in trapped-ion systems also allows the advantages of different 
atomic species to be combined. Quantum logic spectroscopy is 
one application, in which the high degree of control achieved over 
selected atomic ions is extended to species over which such con- 
trol is lacking'>'®. Here, we demonstrate quantum logic operations 
between a single molecular ion and a co-trapped atomic ion. Our 
findings make a wide range of molecular ions accessible to quan- 
tum logic operations. Our technique also allows single molecules 
to be investigated in a well isolated system without being disturbed 
by the environment (the limiting factor in other implementations 
of single-molecule spectroscopy, such as surface-enhanced Raman 
spectroscopy”). 

Quantum logic operations between atoms are based on quantum- 
state-dependent forces, often induced by laser fields. The same 
approach is applicable to molecular ions. Unlike with atomic ions, 
however, the laser coupling to the molecular ion is distributed over 
many rotational-vibrational transitions, which weakens the interaction 


strength for individual transitions (see Methods). The strength of the 
optical dipole force (ODF) on the molecular ion is inversely propor- 
tional to the detuning of the dipole laser from an optical transition in 
the ion. This transition frequency depends on the rotational state of the 
molecule. By measuring the strength of the dipole force, we can thus 
infer the rotational state. If the strength of this force is oscillating with a 
frequency close to one of the secular motional frequencies of a two-ion 
crystal (consisting of a molecular and an atomic ion), it is resonantly 
enhanced, generating coherent states of motion, which can be detected 
on the atomic ion*”>. 

Here, we extend this simple detection scheme. We detect the small 
ODF that arises from a laser field tuned near the X!D*+(J= 1) > 
A!'D*t (J=0) resonance (detuning Angu) in a *4MeH* molecular 
ion, in the presence of a background force (which is nearly inde- 
pendent of detuning) from a co-trapped Mg” atomic ion (Fig. 1c; 
X'D* and A'D* label the molecular energy levels, and the superscript 
‘Tl refers to the multiplicity of the electron spin). Black-body radiation 
induces transitions between rotational states (J), resulting in a thermal 
rotational-state population with a maximum at J =4 for temperatures 
near 300 K. We chose to probe the J= 1 state (which makes up around 
8% of the population of rotational states), because of its long average 
dwell time of around 3 seconds (see Extended Data Fig. 1). The vibra- 
tional population with vibrational quantum number v > 0 in *“MgH™ is 
negligible at room temperature, owing to the large vibrational spacing”. 
Applying the ODF resonant with either the in-phase or the out-of-phase 
mode implements a displacement operator that results in coherent 
states of motion?’, which are impractical for state discrimination 
because of their non-orthogonality. Instead, we map the molecule’s 
internal state information to an engineered motional two-level quan- 
tum system (motional qubit) that is coupled by the ODE. This approach 
allows us to suppress the background force from **Mg* and enables 
advanced coherent-population-transfer schemes, such as composite 
pulses”). 

The state-detection algorithm starts with both axial motional modes 
of the two-ion crystal (which consists of a *>Mg* and a **MgHt ion) 
initialized in the motional ground state |0) m= |0)1p |0) op via the atomic 
ion”’, (‘IP’ denotes ‘in-phase’; ‘OP’ denotes ‘out-of-phase.) Two long- 
lived hyperfine states in **Mg* serve as the atomic qubit denoted by 
||) mig» |T)mg (Fig. 1a), initialized to || )mg. The motional qubit is pre- 
pared in the state | |) =|1)1p |0) op by driving a x-pulse on the blue 
sideband transition (BSB) of the in-phase mode on the atomic ion while 
changing its internal state from | |)g to |1)mg. The ODF is imple- 
mented by applying two counter-propagating laser beams along the 
axial direction of the ion trap, forming an oscillating one-dimensional 
optical lattice. By adjusting the relative detuning, 6, between the two 
beams to the difference between the motional frequencies of the axial 
in-phase and out-of-phase modes, 6 = wop — wp = Ww yp(/3 —1),a 
Raman coupling involving only the motional states 
|) |7)m = 10) |1)op is implemented. This then results in Rabi flop- 
ping between the two composite states (Fig. 2 and Extended Data Fig. 2). 
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Figure 1 | Coupling strength of an optical dipole force to the atomic and 
molecular ions. a, Partial level scheme of the relevant atomic (left) and 
molecular states (right) and their laser coupling to the lattice laser. J refers 
to the rotational state; w is the laser frequency; *P3/2, 7Pi/2, *Sij2, A'S * and 
X'X* are atomic/molecular energy levels, with the superscripts denoting 
the multiplicity of the electron spin; Ayyg and Ayigrs represent the amount 
of detuning from atomic/molecular resonance; 6 represents the relative 
detuning of the two laser beams forming the optical lattice. The inset shows 
the molecular substructure for the states involved in our state-detection 
sequence. The number on the transition in the inset is the experimentally 
implemented geometric coupling coefficient. b, The two lattice beams form 
a moving interference pattern, leading to a temporally and spatially varying 
light force on both ions (yellow, *Mg*; blue, “MgH"). c, Rotational state 
(J, mj=0) and detuning-dependent coupling strength of the optical dipole 
force, interacting with a**MgH™ molecular ion on the P-branch (J J—1, 
solid line) and R-branch (J J+ 1, dashed line) of the X'D* —A!D* 
electronic transition. The blue dotted line denotes the coupling strength to 
the Mg" ion. 


The interaction between the lattice laser field and the motional qubit 
is described by the Hamiltonian (see Methods): 


H= RQ pNop(4wabp + a}paop) (1) 


where a‘, /dyp(op) is the creation/annihilation operator for the 
: IP(OP) : ‘ 

in-phase (or out-of-phase) mode, 77p(op) is the Lamb-Dicke parameter 
for the in-phase (or out-of-phase) mode”, h is Planck’s constant divided 
by 21; and (Qiis the effective Rabi frequency. It is given by: 


Deg = Quigyfl — 2R(Amgn)cos (5) + R(Amga)? (2) 


where R(Anig) = Qugu(Amgr)/ 2ug is the ratio between the Rabi 
frequencies of the Raman couplings in the molecular and atomic ions, 
depending on the detuning Ayigu. The distance between the ions deter- 
mines the relative phase, dy, between the forces on the ions". Ina 
Bloch-sphere picture, the effect of the ODF on the atom and the mol- 
ecule can be illustrated as the vector sum of two torques, 
Fopr= Fog + Fog acting on the motional qubit (Fig. 2b, c). In the 
experiment, the time, ta, in which the lattice laser interacts with the 
ions is adjusted such that for 2yigt = 0 (that is, the molecular ion is not 
in J=1) a 7-pulse is performed on the motional qubit (Fig. 2b). If the 
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Figure 2 | Mapping of the molecular state to the motional qubit. 

a, Circuit description of the mapping process. For ideally chosen 
parameters Aygy and 65y, the circuit is equivalent to a CNOT. The action 
of the optical dipole force (ODF) on the motional qubit is represented by 
the torque Fopg = Fag + Fegu- b, For J+ 1, only the atomic ion interacts 
with the lattice. The ODF creates a torque, Fyig, on the Bloch sphere 
(purple arrows). c, If J= 1, the additional force on the molecule (torque 
Fyigu, depicted in blue) is added to the force on the atom, leading to a 
modified population transfer. 


molecule is in the probed state (2ugt + 0), the additional force changes 
the population transfer (Fig. 2c). This step transfers the internal infor- 
mation of the molecular state to the motional qubit. The second BSB 
m-pulse maps the motional-state population (containing the molecule’s 
internal state information) to the atomic qubit. The population in state 
||) mg follows: 


Pie = |! + cos(x 1 — 2R(Amgu)cos(6y) + R(Amgn)* ) (3) 
and is detected using state-dependent fluorescence on the **Mg* ion, 
where | |) mg fluoresces and |) mg does not. The full sequence is shown 
in Extended Data Fig. 3. 

The calculated signal P),),,, is shown in Fig. 3a for different phases 
(dy) and different detuning Avg values. The maxima correspond 
to rotations of 27n on the Bloch sphere of the motional qubit, with 
n€N. In particular, at the outermost peaks (Fig. 3b), the forces from 
the atomic and molecular ions cancel or add up to 27 exactly, 
depending on the sign of Aygu. There is a trade-off in the choice of 
lattice-laser detuning from the molecular resonance. That is, if the 
detuning is too large, the ODF will not be detectably different for a 
molecule in the rotational state of interest compared with a molecule 
in any other state. For a detuning too small, spontaneous Raman scat- 
tering will change the rotational state before it can be detected. Under 
ideal experimental conditions, it is possible to choose a combination 
of the phase dy and the detuning Aig; to perform state detection in a 
single experimental repetition (cycle), because it is free from quantum 
projection noise (Pli)atg = lifthe molecule isin J=1, and Pug = 0 ifit 
is not). In this case, the experimental sequence can be understood in 
terms of a CNOT gate acting on the motional qubit with the molecular 
rotational qudit |J) MguH as a control (Fig. 2a): 
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Figure 3 | Non-destructive state detection. a, The theoretically calculated 
signal P\,),,, on the atomic ion (that is, the proportion of the population that 
is in the| | )Mg state), for **MgH* in the |J= 1, m;=0) grr state, and its 
dependence on the relative phase, dy, of the optical force between the atomic 
and molecular ion, and on the detuning, Agu, from molecular resonance. 
b, Cross-section of panel a, for the phase dy ~ 0.6, corresponding to a trap 
frequency of wp * 2.17 MHz. ¢, d, The height of the observed signals as a 
function of detuning for the regions indicated in b. The inset in c shows a 


Ja 1) Mex ee _ J# 1) Mgu|t)m 
J 1) mgtt|T)m > [J 1) Mgr| 1 )m 


=1) mgtt| |)m — J=1) Mgt] Lm 


=1)mgtt|T)m > J=1)Mgu|T)m 

A typical measurement with a positive detection event can be seen 
in the inset of Fig. 3c. If the molecular ion is not in the probed rota- 
tional state, the n-pulse on the motional qubit is successful and we 
observe low fluorescence, (P| Late © 0). After entering the J= 1 state, 
the additional force on the molecule will change the 7-time on the 
motional qubit and we observe fluorescence on the *Mg* ion 
(P\\) ag > 0), until the rotational state changes again. The signal corre- 
sponds to a quantum jump of the rotational state in the molecular 
ion, induced by black-body radiation. The signal background of 
P11) © 0-2 and reduced contrast arises from experimental imperfec- 
tions in implementing the logic gates (see Methods). Because the 
algorithm implements a quantum non-demolition measurement of 
the molecular state, we reduce quantum projection noise by averaging 
each data point over 30 experimental cycles with a duration of about 
9.3 milliseconds per cycle. We identified an experimental event as a 
positive detection event if at least three consecutive data points were 
significantly (1.50) above background (see Methods). In a total meas- 
uring time of approximately 23 hours, we observed 18 positive detec- 
tion signals of up to several seconds’ duration. The estimated average 
number of scattered photons from the optical lattice beams that inter- 
act with the molecular ion during a single detection cycle at 
Amgu = 25 GHz is around 0.04 (see Methods). If we assume that a 
single scattered photon removes the molecule from its rotational 
state, this results in an average single-cycle detection efficiency of 
96%, neglecting imperfections in any of the qubit and gate operations; 
this therefore represents an upper bound. Estimates of the expected 
number of events—calculated using this detection efficiency, the 
mean population and the required averaging time of about 1 second 


typical measurement result for a quantum jump (driven by black-body 
radiation) into and out of the J= 1 rotational state. The error bars (68% 
confidence interval) are dominated by quantum projection noise. The 
thickness of the theory bands (green areas in c, d) accounts for small changes 
in trap frequency (and thus in dy) for the different data points. The grey 
data points are excluded from the analysis in Fig. 4, because they could not 
be assigned to a specific slope of the theoretical prediction, thus preventing 
unambiguous inversion of equation (3). 


for a positive signal event—agree well with the experimental results 
(see Methods). 

The lattice light introduces an AC Stark shift in the molecule’s spec- 
tral lines of the order of several hundred kilohertz, which is much 
larger than all magnetic couplings in the molecular ion. Therefore, the 
substates are labelled by m, with the quantization axis along the electric- 
field direction of the lattice light. In this coordinate system, only 
n-polarized light is present, coupling only to states with m;=0 
(Fig. 1a inset). The resulting signal as a function of the detuning and 
the molecular quantum state is shown in Fig. 3c, d for 69 + 0.61, corre- 
sponding to anormal mode frequency of up * 2.17 MHz. Experimental 
data for dy ~0 are shown in Extended Data Fig. 4. 

The dependence of R(Awgu) on the detuning Amer from a molec- 
ular resonance allows us to perform spectroscopy, because forces 
arising from coupling to other rotational states are negligible over a 
large range of detunings (Fig. 1b). For this, we have combined data 
taken for different detuning Angry values and different phases (6) 
by inverting equation (3) numerically and accounting for the reduced 
contrast (Extended Data Fig. 2). The measured Rabi frequency ratios, 
R(Angu), for the X'=*(J= 1) + A'X*(J=0) transition are summarized 
in Fig. 4. Also shown in Fig. 4 is a fit to the wings of a simplified model, 
Ree(f) = A/|f—fo|, with the free parameters A and fo, where we assumed 
a detuning-independent background coupling to the far-detuned tran- 
sition in the Mgt ion (Fig. 1c). We determine the transition frequency 
to be fo = 1067.74789(40) THz (68% confidence interval), which is in 
agreement with a previous measurement of fo = 1067.7473(15) THz 
(ref. 26). The fitted characteristic detuning for the Rabi frequency ratio 
is A =45.98(73) GHz (68% confidence interval). 

We estimate that non-destructive detection of the internal state 
of a molecular ion with near 100% efficiency could take less than 
10 milliseconds, given a larger detuning Aygx of roughly 100 GHz and 
improvements in preparing and manipulating the co-trapped atomic 
ion (see Methods). This represents a major advance over previous, 
destructive detection techniques, which require frequent reloading. 
Our technique will realise its potential in high-precision spectroscopic 
investigations of narrow transitions between states in molecules using 
an independent spectroscopy laser. However, whereas in our present 
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Figure 4 | Quantum logic spectroscopy. The Rabi frequency ratio, 

R= Qygu(Amgr)/ mg, as a function of detuning Avigr from a 
molecular resonance, for all data for which equation (3) could be 
inverted unambiguously (see Fig. 3c, d). Measurement data recorded at 
wp & 2.17 MHz (6y % 0.67; nine points; see Fig. 3) and wyp 2.21 MHz 
(dy ¥ 0; seven points; see Extended Data Fig. 4) are indicated by blue 
circles and red diamonds, respectively. The solid line is a fit to the 
combined data set, resulting in fy = 1067.74789(40) THz. The error bars 
correspond to the errors shown in Fig. 3 and Extended Data Fig. 4 
(68% confidence intervals). 


work black-body radiation probabilistically populates the detected 
state, precision spectroscopy will require more-efficient state-prepa- 
ration schemes”*”’ or rotational—vibrational cooling techniques 7"!!. A 
combination of these tools will enable the realization of optical clocks 
based on molecular ions approaching fractional frequency uncertain- 
ties at the 1078 level?’—where the underlying clock transitions, or 
combinations of transitions, can be sensitive to variations in funda- 
mental constants’, to an electron electric dipole moment’, or to parity 
violation in chiral molecules. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Experimental set-up. The experiments are carried out in a linear Paul trap, witha 
typical axial trap frequency of 2.21 MHz for *Mg™. We use a hyperfine qubit con- 
sisting of the states ||) mg=|F=3, mp=3) and |})mg=|F=2, mp=2) of the 2S1/2 
electronic ground state (mr is the projection of the total spin, F, on the quantization 
axis). The Raman laser system for coherent manipulation of the *Mg* qubit via 
the °P3/ level is based on a frequency-quadrupled fibre laser*!. The detuning from 
the atomic resonance is fixed at 9.2 GHz, limiting ground-state cooling efficiency” 
and qubit operations owing to spontaneous Raman scattering**. The same laser is 
also used for internal state discrimination by applying an optical sideband tuned 
near resonance of the ||) mg /°P3/2, F=4, mp=4) component of the D2 transition. 
A similar laser system addressing the 7S} /. +P 1/2 (D1) transition is used for optical 
repumping and quenching during sideband cooling””. 

The light fields creating the optical lattice for molecular-state detection and 
spectroscopy are provided by a frequency-doubled dye laser whose frequency is 
tuned close to the X'=*+(J= 1) — A!S+(J=0) transition of “MgH*, with a variable 
detuning Ayigu typically smaller than ~100 GHz from the molecular transition, 
and detuned by Aygu © 1.5 THz from the atomic D1 transition of Met. Two 
counter-propagating laser beams derived from this laser are linearly polarized in 
the horizontal plane with a relative detuning 6. They are aligned along the trap axis 
to form an angle of 1/4, with the atomic quantization axis being given by a static 
applied magnetic field of ~0.58 mT. The detuning between the two laser fields 
results in a moving optical lattice, providing a temporally and spatially varying 
force on the two ions with a relative phase 6y, which depends on the distance 
between the two ions. Two optical fibres suitable for ultraviolet single-mode 
transmission® are used to reduce the effect of beam-pointing fluctuations on the 
coupling strength, and aid in the initial alignment of the beams onto the molec- 
ular ion. The frequency of the spectroscopy light is monitored by a commercial 
wavemeter (High Finesse WS/7). We assign an uncertainty of less than 100 MHz 
to the measured value. 

To reduce quantum projection noise resulting from imperfect single-qubit oper- 
ations, we average over 30 measurement cycles to get a single measurement point. 
This procedure takes 280 ms. If at least three consecutive points are significantly 
above the threshold (1.50), we assign them to a positive detection signal from the 
molecule. To determine the signal height, we take the first and last measurement 
points into account only if they are at least 20 above the background. This proce- 
dure improves the signal quality, because we remove data points that contain only 
a fraction of positive detection events during the 30 repetitions. 

Hamiltonian for lattice coupling. The Hamiltonian (H) for N ions coupled 
by a one-dimensional moving optical lattice in the interaction picture can be 
written as: 


N N 
H=)> AQ e "exp do in (ane H+ aye) +huc. 
j k 


where j labels the ions; k labels the axial motional modes with frequency wy; 1x is 
the Lamb-Dicke parameter; ah is the creation operator; h.c. is the hermitean con- 
jugate; yj; is a phase. 2) = §2);§22;/(2A) is the two-photon Rabi frequency for the 
two lattice beams coupling to an electronically excited state with detuning A. 
Considering the case of two ions j € {1, 2}, we have to take two axial motional 
modes into account, namely k € {IP, OP}, denoting the in-phase and out-of-phase 
motion of the two ions, respectively. In the following, the relative detuning of the 
two lattice beams, 6, is chosen to be 6=wop — wp. After the rotating wave approx- 
imation, only the second-order mixed term in the Lamb-Dicke approximation 
survives, resulting in the following interaction Hamiltonian, mediating the cou- 
pling between the two modes: 


aap AQ ein; Nj, op4PAOp +hic. 
j=l,2 


In adding up the forces on the two ions, we have to consider the different effect 
on both modes, and take into account the signs in the matrix for transformation 
between normal modes and laboratory frame**. We find for two ions with equal 
mass the Lamb-Dicke factors: 


wp = "No,1p = Np 
op = —2,0p = "op 
and the Hamiltonian reads: 
H= —Finypnopawabyp| Qe! — Dye!?2] +h.c. 


It is convenient to define: 
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P=, + Y, 


The Hamiltonian can then be written as: 
H=—e'®/2fin pnoparpadyp[ Qe®*/? — Qre-®#/?] + hic. 
We introduce the modified Rabi frequency: 
Q= OegeiPet = — ei? 2[ Qei¥/2 — Qye-id/2] 


with amplitude: 


Qefe = Qo,] 1 — 2Reos(6y) + R? 


where R= §2;/{2 is the ratio between the Rabi frequencies and ®,¢ is an effective 
phase. Identifying 2) = Quigu and (2; = Qug with the Raman Rabi frequencies 
for the molecular and atomic ions, respectively, we arrive at equation (2) of the 
main text. For very large detunings from either the atomic or the molecular ion, 
the effective Rabi frequency reduces to Qygy and yg, respectively. The final 
interaction Hamiltonian is then given by: 


H=hQermpnop(e™“arpabp + &ajpaop) (4) 


which is a Hamiltonian that describes a system consisting of n + 1 coupled levels, 
where n is the number of initial excitations. In the case n= 1, we get a two-level 
system consisting of the states | |) m=|1)1p |0)op, |t)m=|0) |1)op, which are cou- 
pled with the Rabi frequency ®.. The action of this Hamiltonian can be described 
as Rabi flopping between motional states. Results from an experimental imple- 
mentation are shown in Extended Data Fig. 2 for a two-ion crystal consisting of a 
?5Mg* ion anda molecular ion in a rotational state not addressed by the lattice. 
The reduced initial contrast and its decay are attributed to imperfect ground-state 
cooling, off-resonant scattering of the Raman beams that are interacting with the 
atom, and dephasing of the lattice beams. Starting from both axial modes near the 
ground state, a quantum of motion is added using a BSB 7-pulse. The lattice cou- 
pling is then switched on for a variable time and the motional information is 
mapped back onto the electronic qubit by a second BSB 1-pulse (see Extended 
Data Fig. 3). In such a sequence, we are not sensitive to the phase factors, etiPelt, 
which can therefore be neglected in the Hamiltonian, resulting in equation (1) of 
the main text. Note that an additional phase of 7x is introduced if the sign of the 
detuning differs for the atomic and molecular resonances, so the relative phase is 
given by: 


ondAk 
la 2n{dAk + 7] 


where d is the distance between ions and Ak= k, — k2 = 2k, is the difference 
between the wavenumbers of the two lattice beams (the top part of the equation 
relates to the same sign, and the bottom part to the opposite sign). 

Coupling of the optical lattice to the molecular ion. The coupling strength of the 
lattice laser to the molecular ion can be described by the transition matrix element 
between two states, |€”A"v"J"m}) and |é’A'v’J'm;), with A being the projection 
of the angular momentum on the molecular axis, which is A’ = A” =0 for both 
involved states, since we have a ) — » transition without electron angular momen- 
tum. The electronic part of the wavefunctions is given by €’= A and €”=X in 
spectroscopic notation. The vibrational and rotational quantum numbers are given 
by v and J, respectively, and m7 is the projection of J onto the quantization axis. 
Note that the quantization axis for the molecule is given by the electric field 
of the lattice light, since the AC Stark shift causes dominant-level splitting. The 
transition-dipole matrix element is usually approximated by: 


(EJ mM! dal 60" my A!) = 


fee EO, v") SEL, N'sy", A) seeomy!, m, J", my) 


where /ix, is the electronic transition dipole moment. The Franck-Condon factor 
describing the overlap of the wave functions of the two vibrational ground states 
can be calculated from the data given in ref. 36: 


2 
s®C(y! =0, v" =0) = [foe ob nant ~ 0.0919 
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The Honl-London factor describing the dependence of the transition strength on 
the rotational quantum numbers is given in ref. 37: 


SHL(y!, Al a 0; FE Au = 0) = max(J’, J") 


The geometric factor describing the addition of total angular momentum of the 
molecule and the angular momentum of the photon is given by: 


yi 1 y" 
se (my sJ"m/) = |e4 capil ” 
J m q m,; ii 


with the Wigner 3j-symbol and the polarization components 4. Because the quan- 
tization axis is given by the lattice, it is convenient to calculate the coupling in a 
basis parallel to the lattice electric field, where the polarization components are 
e l=el=Oande=1. 

The Rabi frequency from the interaction of the optical lattice with the molecu- 
nae ea (equation (4)) starting from a specific rotational state |£”"J""my ) is given 
by”: 


(€'v'J'mj|dg]€"v"J" m} ) P 


AmgH 


E\E ef 
OQmigh = ayy | 
2h? tg 


2 
—FiEr Mxa SFCSHLY J gecom 
2h? Amex a myq 


2 
—FiEa Mxa_ grcctitggeom 
2h? Amex 


where £, and E) are the electric-field amplitudes of the two counter-propagating 
laser beams forming the optical lattice, and Ayigt is the detuning from the molec- 
ular resonance. Using a similar approach for the atomic ion, we get a Rabi frequency 
ratio that is independent of the applied electric field. From the well-known transi- 
tion strengths in Mg” and the fitted value for A of 45.98(73) GHz (68% confidence 
interval), we extract a dipole matrix element for the molecular transition of 
HS? = 1.2510(99) ea, which is smaller than the matrix element em = 1.779ea9 
extracted from the theoretical data presented in ref. 36. We use the experimental 
dipole matrix element for the curves in Fig. 3 and Extended Data Fig. 4. 
Scattering rate and single-cycle detection efficiency. We assume that scattering 
of a single photon on the optical transition of the molecular ion will change its 
rotational-vibrational state, owing to the non-vanishing Franck-Condon factors 
between different vibrational states. Therefore, a small scattering rate is desired. 
In the following we use a simplified model in which the molecular ion is treated as 
a two-level system to estimate the order of magnitude for the spontaneous emis- 
sion rate of the molecular ion that is subject to the ODE. The number of scattered 
photons from a two-level system, with excited-state linewidth J, interacting for a 
time f, with an optical lattice formed by two laser beams of equal Rabi frequency 
Qand detuned by Ay from resonance is given by: 


_ 102? 
2A Mgt 


ty 


scene 


For the implemented quantum algorithm, the time ¢, is chosen such that the 
coupling to the atomic ion drives a 7-pulse between the two motional states, 
t= T/(Qugmpnop). Using R= Qygu/Qmg and Quigu= ?/ (2Amgt), we obtain: 


ae == 
AmgH™ pop 


where [9 for the molecular ion is given by: 


3 
Wo 2 

5S Oe 
3neoh, 3 XA 


with the scaled transition dipole moment /ix,q given above. For Poissonian photon 
distribution, the probability to scatter no photon follows: 


Prcatt(0) => eo Picts 


For a detuning of roughly 25 GHz, we expect to scatter on average 0.04 photons, 
which corresponds to a fundamentally limited single-cycle detection efficiency 
of Ess = Pscatt(0) 96%. For an increased detuning of — 100 GHz, the number of 
scattered photons on average would be reduced to 0.0026 for a 1-pulse with the 
optical lattice. However, to get a large enough signal contrast for single-cycle detec- 
tion, it would be necessary to perform a 21-pulse with the lattice, which leads toa 
single-cycle detection efficiency of 99.5%. 

The number of positive detection events, Ney for a total number Nota of trials 
can be estimated from: 


3N, 
Nev = (Ess) cycle PihermP1>3 Mtotal 


where Piherm is the probability of the molecule being in the probed state for a 
thermal distribution at 300K, and P;-; is the probability of the molecular ion 
staying at least for the time of three measurement points in the probed state, which 
is a prerequisite for positive detection in the experiment. In our case, we have 
Nootal © 2 X 10°, Piherm = 0.08, P;+3= 0.8 (see Monte Carlo simulation in Extended 
Data Fig. 1) and Neycte= 30. For a single-shot-detection efficiency of the order of 
Es. © 95%, this results in an expected number of points belonging to a detection 
signal of Ney of about 130, which is in agreement with the observed number of 
points N<\? = 100 assigned to the 18 positive detection events. 

Uncertainties of the fitted parameters. We extracted the centre frequency and 
the line strength from a weighted nonlinear least-squares fit. The assigned 
uncertainties correspond to the 68% confidence bounds, C, that have been cal- 
culated assuming a normal distribution of errors and according toC = b+ tS, 
where b is the fitted coefficient, t is the inverse of the Student’s cumulative 
distribution function for the required confidence, and $= (X'X)~1s2, with the 
mean squared error s” and the Jacobian X of the fitted values with respect to the 
coefficients. 
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Extended Data Figure 1 | Single trajectory from a Monte Carlo 
simulation of molecular dynamics. a, The ion initially prepared in the 
rotational ground state is transferred to higher rotational states because 
of coupling to black-body radiation at 300 K. b, The probability of finding 
the ion in a certain rotational state in the simulation (red bars) follows a 
thermal distribution. The blue bars are calculated values from a master 
equation approach. The deviation between the red and blue bars results 
from the finite time interval of the Monte Carlo wavefunction simulation. 
c, The dwell time decreases for higher rotational states. 
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Extended Data Figure 2 | Rabi flopping between motional states. 
Implementation of the sequence shown in Fig. 2 for Qyigu=0. The 
duration of the applied optical lattice is varied to induce Rabi flopping 
between the motional qubit states | |),,, and ||). The error bars show the 
95% confidence interval of the photon-distribution fit**. The red dashed 
line shows a fit to a damped oscillation. 
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Extended Data Figure 3 | Full experimental sequence. a, Circuit 
description of the sequence. i, A BSB 1-pulse initializes the atom in the 
state |)mg and the motional state in the qubit state | |) m. ii, The ODF 
rotates the motional qubit controlled by the internal state of the molecule 
(see Fig. 2). iii, A second BSB x-pulse maps the motional state (which 
contains the information about the molecule’ internal state) to the 
atomic qubit. iv, The atomic qubit’s state is read out by state-dependent 
fluorescence. b, Pictorial representation of the laser couplings in a 
simplified level scheme. 
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Extended Data Figure 4 | Raw data for dy ~ 0. a, Theoretically predicted 
signal for dy ~ 0, corresponding to wyp 2.21 MHz. b, Expansion of the 
region shown in a with the seven measured data points that are shown as 
red diamonds in Fig. 4. The error bars indicate the 68% confidence interval 
of the photon-distribution fit®. 
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Possible light-induced superconductivity in K3C¢o 


at high temperature 


M. Mitrano!, A. Cantaluppi!’, D. Nicoletti>?, S. Kaiser!, A. Perucchi’, S. Lupi*, P. Di Pietro*, D. Pontiroli®, M. Ricco, 


S. R. Clark®7, D. Jaksch’”’® & A. Cavalleri!2:” 


The non-equilibrium control of emergent phenomena in solids is 
an important research frontier, encompassing effects such as the 
optical enhancement of superconductivity!. Nonlinear excitation”? 
of certain phonons in bilayer copper oxides was recently shown to 
induce superconducting-like optical properties at temperatures 
far greater than the superconducting transition temperature, T- 
(refs 4-6). This effect was accompanied by the disruption of 
competing charge-density-wave correlations”’®, which explained 
some but not all of the experimental results. Here we report a similar 
phenomenon in a very different compound, K3Cgo. By exciting 
metallic K3C¢o with mid-infrared optical pulses, we induce a large 
increase in carrier mobility, accompanied by the opening of a gap 
in the optical conductivity. These same signatures are observed 
at equilibrium when cooling metallic K3C¢o below T, (20 kelvin). 
Although optical techniques alone cannot unequivocally identify 
non-equilibrium high-temperature superconductivity, we propose 
this as a possible explanation of our results. 

Molecular solids of chemical formula A3C¢o often crystallize in 
face-centred cubic structures (Fig. 1a) in which each Cgq? ion contrib- 
utes three half-filled molecular orbitals to form narrow bands?. These 
electronic states give rise to superconductivity at equilibrium'”!!, medi- 
ated by a combination of electronic correlations!*” and local molecular 
vibrations!*"!°, 

In the experiments reported here, femtosecond mid-infrared optical 
pulses were used to excite K3Cgo powders (see Methods and Extended 
Data Fig. 1 for sample characterization). The excitation (‘pump’) pulses 
were tuned to wavelengths between 6 um and 151m (200-80 meV pho- 
ton energy), a spectral region in which both the metallic plasma and 
local molecular vibrations could be excited. The resulting changes 
in terahertz (THz)-frequency reflectivity and optical conductiv- 
ity were measured with a second pulse (the ‘probe’), a quasi-single- 
cycle THz-field transient, which was detected after reflection from the 
photo-excited material (see Methods and Extended Data Figs 2, 3). 
These transient changes were determined at different pump-probe time 
delays and normalized to the absolute equilibrium optical properties 
measured on the same sample (Extended Data Fig. 4). 

Figure 1c displays two representative frequency-dependent equilib- 
rium reflectivity spectra R(w) measured above and below T- (20K). The 
corresponding real and imaginary part of the optical conductivity, 0 (w) 
and o2(w), as extracted by Kramers-Kronig transformations, are shown 
respectively in Fig. 1d and e. Similar to what has been reported for K3Cgo 
single crystals'”~’, in the metallic state (T'= 25 K) the optical conduc- 
tivity displayed a narrow Drude peak and a polaronic band”” centred 
at 55 meV. When cooling below T,, a saturated reflectivity (R= 1), an 
optical gap in o)(w) and a divergent o2(w) emerged from the metal. 

This equilibrium superconducting response is to be compared with 
that measured in metallic high-temperature K3Cgp after excitation 


at 7 1m wavelength (180 meV photon energy), which is tuned to the 
Tu(4) vibration (see Fig. 2 for measurements at 1 ps time delay). For 
all temperatures between T:= 20K and T= 100K, photo-excitation 
drove transient changes in the optical properties, which were very sim- 
ilar to those observed when cooling at equilibrium (see also Extended 
Data Fig. 5). Ata 1 ps pump-probe time delay, we observed a saturated 
reflectivity (R= 1; Fig. 2a and d), gapped o)(w) (Fig. 2b and e), and 
divergent 02(w) (Fig. 2c and f). 

Furthermore, all transient optical spectra could be fitted with the 
same function used for the low-temperature superconductor at equilib- 
rium. An extension of the Mattis-Bardeen model was used, applicable 
to superconductors of arbitrary purity and taking into account both 
an optical gap (a superconducting component) and a fluid of normal 
quasi-particles (a Drude component)’!. The non-equilibrium fits 
evidenced an 11 meV gap in the non-equilibrium o)(w), nearly twice as 
large as the 6 meV superconducting gap at equilibrium. 

Figure 3 reports similar measurements taken at higher temperatures, 
for which the effect progressively disappeared as temperature increased. 


Rio) 


) 
1,000 10 100 
Energy (meV) 


0 
1,000 10 100 
Energy (meV) 


10 100 
Energy (meV) 
Figure 1 | Structure and equilibrium optical properties of K3Cgo. 
a, Face-centred cubic (f.c.c.) unit cell of K3C¢o (ref. 26). Blue bonds link 
the C atoms in each C¢o molecule. K atoms are represented as red 
spheres. b, Ceo molecular distortion (red) along the T,,(4) vibrational 
mode coordinates. The equilibrium structure is displayed in blue. 
c-e, Equilibrium reflectivity at the sample-diamond interface R(w) (c) and 
complex optical conductivity (d, real, o)(w); e, imaginary, 02(w)) of K3C¢o 
measured at T= 25 K (red) and T'= 10K (blue). Hatched regions highlight 
changes across the superconducting transition. 
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Figure 2 | Transient optical response of photo-excited K3Cgo at T= 25K 
and T= 100K. Columns as Fig. 1; data taken at equilibrium (red) and 1 ps 
after photo-excitation (blue) at a pump fluence of 1.1 mJ cm~’, measured 


Only a partial reduction in o|(w) anda less pronounced divergence in 
o2(w) were observed. For these higher temperatures the Drude com- 
ponent alone was sufficient to fit the transient spectra, which did not 
exhibit a clear gap. However, we note that the a;(w) Drude peak in the 
light-induced phase (blue curves in Fig. 3b and e) was always narrower 
than for the equilibrium metal (red curves in Fig. 3b and e), underscor- 
ing an optical enhancement in carrier mobility for all temperatures. 
In Fig. 4, we plot the transient integrated loss in spectral weight 
(0.75-2.5 THz, that is, 3.1-10.3 meV) as a function of base temperature 
(Fig. 4a), pump-probe time delay (Fig. 4b), excitation fluence (Fig. 4c) 
and wavelength (Fig. 4d). The regions in which the optical proper- 
ties required the full Mattis—Bardeen fit are shaded in blue, whereas 


at temperatures T= 25 K (a-c) and T= 100K (d-f). Fits to the data 
are displayed as dashed lines. Hatched regions highlight pump-induced 
changes. 


those regions in which the fit yielded a gapless, Drude conductivity 
are shaded in white. 

Figure 4a displays a crossover between a light-induced state with 
a Mattis—Bardeen gap (blue), observed up to T= 100K, and a high- 
mobility metal (white) observed at higher temperatures. Similarly, 
in the time-delay dependence (Fig. 4b) a spectral-weight reduction 
was observed immediately after the excitation pulse (red curve), 
followed by a gap opening only at later time delays (blue shading). 
The gapped state disappeared after approximately 2 ps, and the rest of 
the decay involved relaxation of a high-mobility metal (white shad- 
ing) with an exponential time constant of ~10 ps (see Extended Data 
Fig. 6 for representative spectra as a function of pump-probe delay). 
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Figure 3 | Transient optical response of photo-excited K3Cgo at 
T= 200 K and T= 300K. Columns as Fig. 1; data taken at equilibrium 
(red) and 1 ps after photo-excitation (blue) with a pump fluence of 
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1.1 mJ cm’, measured at temperatures T= 200 K (a-c) and T= 300K (d-f). 
Fits to the data are displayed as dashed lines. Hatched regions highlight 
pump-induced changes. 
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Figure 4 | Scaling of the o,(w) gap with experimental parameters. 

In all panels, the photo-induced reduction in o\(w) spectral weight, 

—Aoj, integrated between 0.75 THz and 2.5 THz, is plotted on the 

y axis (represented by filled circles). a-d, The lost spectral weight is plotted 
as a function of temperature (a), pump-probe time delay (b), pump 
fluence (c) and pump wavelength (d). Error bars represent maximum 
uncertainties determined from different sets of measurements. The regions 
shaded in blue indicate the parameter ranges for which the transient 
response was fitted with a model for a superconductor. The red curve in b 
is the pump pulse profile (cross-correlation signal), while the dashed line 
is a double exponential fit (7; + 1 ps, 72 10 ps). Dashed vertical lines in d 
are the frequencies of the four T), vibrational modes. The experiments 
were not possible for pump wavelengths between 61m and 3 um (hatched 
region in d) owing to absorption in the diamond window that contained 
the K3Cgo powders. White and blue filled circles are data taken at fluence 
values of 0.4mJcm~? and 1.1 mJ cm ~’, respectively. Data in b-d were 
measured at T= 25K. 


The pump fluence dependence of Fig. 4c reveals a similar crossover. 
Metallic conductivities (white) were observed for low excitation levels 
(F<0.8 mJ cm~?), whereas fully gapped states (blue) emerged only at 
higher fluences (F> 0.8 mJ cm7”). 

Finally, when tuning the pump to short wavelengths (2 |1m, 620 meV 
photon energy) no response was found (Fig. 4d). On the contrary, 
a strong response was measured when the pump was made resonant 
with the T),(4) and T;,(3) modes, which correspond both to stretching 
and compressing of the hexagons and pentagons of the fullerene mole- 
cule (see Fig. 1b). A weaker response was still observed for wavelengths 
extending towards the T),(2) and T),(1) resonances. 

One possible interpretation for the data above posits that excitation 
of molecular vibrations can coherently stimulate a transition from the 
high-temperature metallic state into a non-equilibrium superconduct- 
ing phase. 

To first order, T1y vibrations do not couple to the fi, electronic states 
that cross the Fermi level, because T}, 4 tiy X tiy. However, for the 
strong optical fields used here (~1.5 MV cm~!), and from the polar- 
izability of the T|, molecular vibrations, we estimate large oscillatory 
distortions of the C-C bonds amounting to several per cent of their 
equilibrium bond lengths. Hence, the large amplitude excitation is 
expected to extend beyond a linear response, and to deform the struc- 
ture of the solid along other, anharmonically coupled coordinates. To 
lowest order, these couplings are described by qriu7Q terms in the non- 
linear lattice Hamiltonian®**”?, where qriy is the directly driven mode 
coordinate, and Q is the coordinate of any distortion contained in the 
irreducible representation of T)y x T1u. 
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As discussed in the Methods section, excitation of the T;, molecular 
mode is expected to drive a sizeable distortion along normal mode 
coordinates of H, symmetry. Because Hg modes are thought to assist 
pairing at equilibrium”, it is possible that a quasi-static Hy molecular 
deformation might favour stronger superconductivity, for example by 
causing an increase in the electron-phonon coupling (see Extended 
Data Fig. 7). 

Excitation of T), vibrations is also expected to modulate local 
electronic correlations. For a molecular mode of coordinate qrr with 
classical vibrational coordinate qja(T) = Csin({2irT), where C is a pro- 
portionality constant, {2;p is the driving frequency and 7 the time delay, 
we expect the charge density on each molecule to ‘slosh’ backwards and 
forwards”*. Deformation of the molecular orbital causes a modification 
of the Coulomb repulsion U, for the t), orbital (a= {x, y, z,}), which 
can be written as U,(7) =U, + AU, (1 — cos(2.Qjx7)). This highlights 
two effects: a modulation of the onsite Coulomb integral at twice the 
frequency of the driven molecular mode, and an average change in 
correlation energy, AU,, which have both been observed in a different 
organic compound”. 

Although a complete quantitative analysis requires a dynamical 
quantum chemistry calculation, an order-of-magnitude estimate for 
these changes in local electronic correlations can be extracted from 
frozen atomic motions for the T|4(4) mode. This estimate predicts a 
change in electronic repulsion AU, as high as ~10% of the equilibrium 
correlation energy for the orbitals orthogonal to the vibrational polar- 
ization (see Methods and Extended Data Fig. 8). Such a large asym- 
metric increase in U, for only one orbital unbalances the occupancy 
of the three tf), orbitals. This effect may interact with the dynamical 
Jahn-Teller coupling known to contribute to superconductivity near 
equilibrium’. 

However, the interpretation of the data proposed above is not unique. 
First, some other transient non-superconducting states would also be 
consistent with the optical properties revealed by our experiments. 
A large reduction in carrier scattering rate without condensation may 
be sufficient to explain these results. The measured gap in the transient 
conductivity would then simply emerge from the equilibrium polaronic 
band at 55 meV as the broad Drude absorption narrows. However, this 
scenario is difficult to reconcile with the estimated increase, rather than 
decrease, of the carrier effective mass (see Methods). Alternatively, one 
may also consider a sliding, non-commensurate charge density wave, 
which would be promoted by the enhanced electron-phonon coupling. 

Second, the mid-infrared pulses used to excite the sample couple not 
only to the vibrations but also to the metallic plasma. The measured 
photo-conductivity signal exhibits a peak in the region of the molecular 
vibrations (Fig. 4d), but this peak appears broadened compared to the 
mode linewidths. This might be an indication that a modulation of the 
local charge density also plays a role in driving the photo-induced state. 

Other observations, not mentioned above, appear important but are 
not well understood. The light-induced high-mobility phase is only 
induced for temperatures above, but not below, T,= 20K. Unlike in 
the case of light-enhanced superconductivity in the cuprates, in which 
excitation of a phonon below T- caused a stiffening of the Josephson 
plasma resonance’, in the present case we did not observe any widen- 
ing of the equilibrium superconducting gap at temperatures below T.. 
Rather, we measured very little perturbation at the same pump-probe 
delay (1 ps) for which the response above T- was maximum (Extended 
Data Fig. 9), followed by a filling in of the gap by hot quasi-particles at 
later time delays (Extended Data Fig. 10). This suggests that the equi- 
librium superconductor and the high-mobility transient state compete 
with one another. One may rationalize this observation by hypo- 
thesizing that equilibrium superconductivity depletes states near the 
Fermi level, which then cannot participate in the formation of the non- 
equilibrium state when stimulated by the light. 

On balance, although a definitive assignment should rest on further 
experimentation and theoretical analysis, we favour an interpretation 
based on stimulated superconductivity. However, the data above are 
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indicative of striking emergent physics away from equilibrium regard- 
less of the specific mechanism, and provide many new opportunities 
and challenges for both theory and experiments. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Sample growth and characterization. Stoichiometric amounts of finely ground 
Ceo powder and potassium metal were sealed in a cylindrical vessel and closed in a 
Pyrex vial under vacuum (~10~° torr). The potassium was kept separated from the 
fullerene powder during the thermal treatment, therefore only potassium vapour 
came in contact with Cgo. The two reagents were heated at 523 K for 72h and then 
at 623 K for 28h. The vessel was then opened and the recovered black powder was 
reground and pelletized. Afterwards, the pellets were further annealed at 623 K 
for 5 days. All described operations were performed in inert atmosphere (vacuum 
or Ar glove box with <0.1p.p.m. O2 and HO). The final product, K3Cgo, was 
characterized by laboratory powder X-ray diffraction and SQUID magnetometry 
(Extended Data Fig. 1). 

Equilibrium optical properties and fitting models. The K3Co equilibrium optical 
response was measured on compacted powder pellets (100-400 nm average grain 
size) sealed in a custom sample holder and pressed against a diamond window, 
ensuring an optically flat interface. 

Broadband infrared reflectivity measurements were carried out at the SISSI 
beamline (Elettra Synchrotron Facility, Trieste)?”. The sample was mounted on a 
cryostat coupled to a Bruker Vertex70 interferometer through a Hyperion micro- 
scope. The sample temperature was varied between 300 K and 25K. The spectra 
were referenced against the reflectivity of a gold mirror placed into the holder at 
the sample position. 

Data below 3 meV have been extrapolated with the Hagen—Rubens formula, 
while at high frequency the recalculated sample-diamond reflectivity from single 
crystal data was used!*, The complex optical conductivity was determined through 
a Kramers-Kronig algorithm for samples in contact with a transparent window”’. 

All spectra at T > T- are reported in Extended Data Fig. 4a. The low energy 
(S200 meV) part of the complex conductivity was fitted at all T > T- with a Drude 
term and a Lorentz oscillator, which reproduced the polaronic absorption centred 
at ~55 meV (ref. 17; a representative fit to the 25 K data is also displayed). 

The sample reflectivity in the superconducting state was determined as follows. 
We first measured the reflected electric field at different T< T. using THz time 
domain spectroscopy. Each spectrum was then referenced against the reflected 
field at 25 K, for which the broadband reflectivity was known from the synchrotron 
measurements. 

In Extended Data Fig. 4b the reflectivity measured at different T < T- is shown. 
The data were extrapolated below 3 meV using the Zimmermann model", a gen- 
eralization of the Mattis—-Bardeen optical conductivity for BCS superconductors 
with arbitrary purity. The optical conductivity was determined through the 
same Kramers-Kronig procedure described above. The real and imaginary 
conductivities are also reported in Extended Data Fig. 4b. 

The gap opening in o;(w) and the 1/w divergence in o2(w) observed upon 
cooling below T- are indicative of the superconducting transition. Through fits 
to the data performed with the Zimmermann model”, the optical gap 2A could 
be determined as a function of temperature. Both its zero temperature value 
2A(T=0) +6 meV and temperature dependence 2A(T)—closely following the 
mean field prediction—were found to be in full agreement with previously pub- 
lished data on K3Cgo single crystals!®, as shown in Extended Data Fig. 4b inset. 

In Extended Data Fig. 4c, d we show a direct comparison of the equilibrium 
optical properties (below and above T-) measured on our K3C¢o powders with those 
reported for single crystals'®. First, we observe that the polaronic absorption at 
~55 meV maintains the very same shape and central frequency. Second, the effect 
that the superconducting transition has on the low-frequency optical properties in 
our K3C¢9 powder is identical to that measured in the single crystals, which exhibit 
the same T. ~ 20K and the same optical gap 2A(T=0) +6 meV. 

The only difference between the single crystal data and the present measure- 
ments is in the carrier density and scattering rate. As is well known for this class of 
materials!’, in powdered samples both these quantities can be different from those 
of a single crystal. From Drude-Lorentz fits to the data, we could estimate a reduc- 
tion of a factor of ~4 in carrier density n (the Drude plasma frequency wp x Jn 
reduces from wen ~ 410 meV to lg ~~ 175 meV) and of a factor of ~4 in the 
scattering rate (from rrvstal ~ 18 meV to [Powder ~ 4 meV). 

Also the spectral weight ca of the 55 meV polaronic band is a factor of 
~4 higher in the single crystal (aa ~ 1,100 meV against (apeer = 540 meV). 
This implies that the quantity We / Qa extracted for our sample is very close to that 
reported in ref. 17, thus indicating that the relative ratio between mobile and 


localized carriers is the same in powders and single crystals. 

Determination of the transient optical properties. THz-frequency probe pulses, 
generated by optical rectification of 800 nm pulses in a ZnTe crystal, were used 
to measure the transient response of the sample after excitation. These probe 
pulses were detected after reflection from the sample-diamond interface in a 
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second ZnTe crystal, yielding pump-induced reflectivity changes for frequencies 
between 0.75 THz and 2.5 THz (3.1-10.3 meV) and with a pump-probe time-delay 
resolution of approximately 300 fs. 

The pump-induced change in the THz electric field AEg(t, 7) = ER’™?*“(t, r) — 
Ep(t) was acquired at each time delay 7 by filtering the electro-optic sampling 
signal with a lock-in amplifier, triggered by modulation of the mid-infrared pump 
with a mechanical chopper. This measurement yielded ‘pump on minus ‘pump 
off’ reflected electric field. 

The differential electric field AE g(t, T) and the stationary reflected electric field 
E(t) were independently Fourier transformed to obtain the complex-valued, 
frequency dependent AEg(w, 7) and Ep(w). 

Importantly, because the pump-induced changes were large (~2%), the same 
measurement was repeated by directly recording Ef“ (w, r)and Ep(w) without 
chopping the pump, and then calculating AEp(w, r) = ER mutt T) — Ep(w). 
This method does not require calibration of the absolute phase of the lock-in ampli- 
fier, and avoids phase errors in estimating the optical properties. The two methods 
yielded identical results. 

In Extended Data Fig. 2, Ep(t) and AEg(t, 7 > 0) are displayed both below and 
above T- with the corresponding normalized reflectivity changes. For all measured 
T> T- and for any 7 >0, all AEg(t, 7) and E(t) have the same phase, indicating 
that above T- one always finds an increase in reflectivity. On the other hand, all 
measurements below T- exhibit an opposite phase, that is, reflectivity is reduced 
upon excitation. 

The complex reflection coefficient of the photo-excited material, 7(w, 7), was 
determined using the relation 


_ ¥(w,T) — 7o(w) 


7o(w) 


AER(w, 7) 
Ep(w) 


To calculate these ratios, the stationary reflection coefficient 7(w) was extracted at 
all temperatures from the equilibrium optical properties, determined independently 
(at the same temperature and in the same sample holder) with synchrotron- 
based infrared spectroscopy. 

These ‘raw’ light-induced changes, which indicate increase and decrease of 
the reflectivity above and below T,, respectively, were reprocessed to take into 
account the penetration depth mismatch between THz probe (L 700-800 nm) 
and mid-infrared pump (d= 200 nm) (Extended Data Fig. 3a). Importantly, 
this renormalization only affects the size of the response, whereas the qualita- 
tive change in optical properties is independent of it and of the specific model 
chosen. 

As in, for example, ref. 1, this mismatch can be taken into account most simply 
by modelling the response of the system as that of a homogeneously photo-excited 
layer with the unperturbed bulk beneath it (Extended Data Fig. 3b). However, this 
model applies well only in case of large pump-probe penetration depth mismatches 
(d/L = 107-10°). 

A more precise method consists of treating the excited surface as a stack of thin 
layers with a homogeneous refractive index, and describing the excitation profile 
by an exponential decay* (see Extended Data Fig. 3c). By calculating the coupled 
Fresnel equations of such a multi-layer system, the refractive index at the surface, 
fi(w, 7), can be retrieved, and from this the complex conductivity for a volume that 
is homogeneously transformed: 


In Extended Data Fig. 3d-f we compare the transient optical properties obtained 
with the exponential decay model (same curves as in Fig. 2a—c) with those obtained 
with the single-layer description (having used the same pump penetration depth 
d=220nm). Both treatments yield very similar results, although we consider the 
exponential model more accurate. 

Note that the extracted optical properties from pump-probe data can sometimes 
be affected at the earliest time delays by perturbed free induction decay”’. This 
issue was raised recently in discussing transient THz probing of superconductors”. 
However, this effect occurs only when coherence is transiently induced on a time- 
scale far shorter than the momentum relaxation time and the response time of 
the detector*!. Here the excitation of the transient high-mobility state is achieved 
with 300 fs mid-infrared pulses, the rise time of the signal is 1 ps, and the relaxation 
occurs within 2-10 ps. Therefore these effects are negligible. 

Comparing transient conductivities below and above T,. Transient optical prop- 
erties above and below T- measured at the same time delay after photo-excitation 
are shown in Extended Data Fig. 9 (T= 1 ps, same above-T- data as in Fig. 2a—c). 

Importantly, although the equilibrium properties of the unperturbed solid are 
very different (normal and superconducting state, red and blue curves in Extended 
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Data Fig. 9a—c and d-f, respectively), the two light-induced states are quite similar 
to one another (light blue curves). In both cases, the state at T= 1 ps after excitation 
is gapped. 

As shown in Extended Data Fig. 2, these transient properties derive in one case 
(T> T.) from a light-induced increase in reflectivity (which saturates to R* 1 at 
w<2A), while in the other case (T < T.), from a decrease in reflectivity (partial 
gap filling). 

The relaxation dynamics of these two states towards equilibrium are shown in 
Extended Data Figs 6, 10. Above T., the gapped state measured at T= 1 ps rapidly 
relaxes into a state with a gapless optical conductivity. The metallic ground state 
is then recovered within ~5 ps. Below T,, after the early time response shown in 
Extended Data Fig. 9, at T= 3 ps the conductivity gap becomes filled. The super- 
conducting ground state is then recovered over relatively long timescales (>20 ps). 
Uncertainties in determining the transient optical properties. We examine here 
different sources of uncertainty and their propagation. The uncertainty in the abso- 
lute value of the measured equilibrium reflectivity is of the order of +1%. Extended 
Data Fig. 5a shows as coloured bands the propagated error bars in the equilibrium 
and transient optical properties for a +1% and a larger +2.5% uncertainty in R(w). 
In both cases the response remains gapped. 

We next examine uncertainty in the Fresnel phase coefficient /3, used in the 
Kramers-Kronig transformations. This has been set to = 217 meV to precisely 
match the central frequency of the polaronic band (55 meV) with that reported 
in previous optical studies!”* on the same compound. In Extended Data Fig. 5b 
we show how a +10% uncertainty in 3 would affect the equilibrium and transient 
optical response. 

Another possible source of error resides in the pump penetration depth value 
used to retrieve the transient reflectivities and optical conductivities. This value 
was set to d=220nm for all analysed data. In Extended Data Fig. 5c we show how 
a +25% change in d would affect the transient optical properties. 

Finally, in Extended Data Fig. 5d we show the effect of a different choice of 
functional form for the decay of7i(w, z) in the multi-layer model. The exponential 
decay used for all data analysed in the paper is compared with a single layer (already 
introduced above) and with a Gaussian-like decay, all having the same d value. 

In all these cases the impact of the different sources of error on the calculated 
reflectivities and optical conductivities is moderate and the qualitative behaviour 
of the spectra is not affected. 

Estimate of the carbon atom displacement. The peak electric field of the mid- 
infrared pulses can be estimated from 


_ F 
\| 2egcAt 


where c is the speed of light, eo is the vacuum permittivity and At is the pulse 
duration. The fluence F typically used in these experiments is 1 mJ cm~?, which, 
for a pulse duration of approximately 300 fs, corresponds to a peak electric field 
E=800kV cm". 

From previous optical measurements”, it is known that the real part of the 
optical conductivity on the pumped T},(4) phonon is 7;[T14(4)] =120Q7! 
while its central frequency is wp = 168.62 meV. The pump-induced polarization P 
(in units of Ccm~’) is then given by 


(Wo) 


Wo 


P=—“E=2.35 x 10° 


The polarization arises owing to a light-induced dipole moment P= ndZ.e, where 
nis the number of dipoles per unit volume, Zee is the effective charge defined as 
in ref. 34 and 6 is the atomic displacement. The Z. determined for this vibration 
is 0.147. Under the assumption that all the incident photons are contributing to 
the excitation, one can estimate from the polarizability a displacement 6= 0.24 A, 
that is, approximately 17% of the C-C bond length. 
Mode coupling and electronic structure calculations. Electronic structure, lattice 
dynamics, and nonlinear phonon couplings after excitation of the T),(4) phonon 
were obtained using density functional theory (DFT) calculations with a plane- 
wave basis set and projector augmented wave pseudopotentials (VASP software 
package)*°. These calculations were performed within the local density approxima- 
tion (LDA) with a 900 eV cut-off for the plane-wave basis set. A 6 x 6 x 6 k-point 
grid and a 0.1 eV Gaussian smearing were used for the Brillouin-zone integration 
during the self-consistency cycles. Lattice parameters and atomic positions were 
obtained by energy minimization, ignoring the Cg orientational disorder. The 
calculated lattice constant is 13.888 A, in good agreement with the experimental 
value of 14.240 A. 

Phonon frequencies and eigenvectors were determined using the frozen- 
phonon approach (as implemented in the PHONOPY software package)**. After 
the normal modes were identified, total energy curves as a function of the Raman 


mode amplitude Q were calculated while simultaneously keeping the amplitude 
of the pumped T},(4) mode finite. A shift of the Raman mode energy minimum 
for finite amplitude of the T,,(4) vibration allowed to identify a cubic q71y7Q cou- 
pling between the pumped infrared mode and the Raman vibration (see Extended 
Data Fig. 7a). 

Electron—-phonon calculations were performed using DFT-LDA (as imple- 
mented in the Quantum ESPRESSO software package)*” with 42 Ry and 420 Ry 
cut-offs for the plane-wave basis set and the charge-density expansions, respec- 
tively. A 6 x 6 x 6 k-point grid and a 0.020 Ry Gaussian smearing were used for 
Brillouin-zone integration during the self-consistency cycles. 

The majority of Raman modes couple to the T),(4) mode. In Extended Data 
Fig. 7a we show the shift in the Hg(1) mode minimum when the amplitude of the 
Tyu(4) mode is 2.0A Vamu. Such a shift implies that the structure is displaced 
along the H,(1) coordinate while the T|,(4) mode is pumped. We also report the 
band structure, electronic density of states (Extended Data Fig. 7b), and electron- 
phonon coupling (Extended Data Fig. 7c) for a structure displaced along the H,(1) 
coordinate with an amplitude of 1.5A Vamu. We find that such displacement 
slightly lifts the degeneracy of the f), states. However, these bands remain part of 
a single manifold, crossing each other at various points along the high-symmetry 
path of the Brillouin zone. 

We also find a modest change in the density of states, which is caused by the 
shift of a van Hove singularity near I‘ as the bands are moved by the Hg(1) mode 
displacement. Furthermore, the bandwidth of the t;, manifold steadily increases as 
a function of Hg(1) mode amplitude. This is reasonable, since the H,(1) displace- 
ments elongate the Cg) molecule and reduce the intermolecular distances, thus 
increasing the intermolecular hopping. 

The H,(1) displaced structure develops strong coupling between electrons and 
intermolecular phonon modes (Extended Data Fig. 7c). At equilibrium the three 
intermolecular modes with 5 meV energy do not couple to the f), electrons, while 
under displacement along the H,(1) coordinate two of these modes exhibit very 
large electron-phonon coupling. 

The electron-phonon coupling, obtained by integrating the Eliashberg func- 
tion a?F(w), is 0.52 at equilibrium and 1.44 for the H,(1) displaced structure. 
Energy-resolved changes in the electron-phonon coupling function, evaluated as 
Aa?F(w) = 07 F(w)Hg(1) — 0° F(W) equilibrium» are reported in Extended Data Fig. 7d. 
The increased coupling at low energy and a strong reshaping at high frequency 
(>130 meV) are apparent. 

Time-dependent onsite correlation energies. We estimated the magnitude of 
changes to the inter- and intra-orbital Coulomb repulsion of the t,, orbitals on a single 
Céo molecule when the T},(4) vibrational mode is driven. The relevant Cy molecular 
orbitals are described by px orbitals projecting radially outwards at each C atom. A 
tight-binding Hiickel model captures the hybridization of these orbitals in the Cgo 
cage. Hopping amplitudes are assumed to vary with the distance d between C atoms as 


V(d) = (1) 


@ 
where A =5.63 eV A. We take the C-C and C=C bond lengths as dy= 1.45 A and 
dp = 1.37 A, corresponding to Vs =2.69 eV and Vp = 3.00eV hopping integrals for 
the pentagonal-hexagonal and hexagonal-hexagonal edges of the Ceo cage. The x, y 
and Z ty, orbitals are the degenerate 31st to 33rd single electron eigenstates of the 
Hiickel model and the lowest unoccupied molecular orbitals of free Ceo (ref. 38). 
In Extended Data Fig. 8a, the wavefunction (chosen to be real) for each ty, orbital 
is plotted over the buckyball with the size of the spheres denoting the magnitude 
and the colour denoting the sign. 

C atom motions corresponding to the T),(4) vibrational mode were taken 
from a complete nearest-neighbour force field model calculation for Co, opti- 
mized to fit the optically accessible frequencies of the molecule’. We applied a 
‘frozen-phonon approach, which assumes the Born—Oppenheimer approximation 
to be strictly obeyed and that the molecular orbitals adiabatically follow the atomic 
distortions. Although this model is not appropriate for high-frequency vibrations 
and low-energy electronic properties, we posit that this approach can provide the 
order of magnitude of the response. The ft), orbitals were then computed for T1,(4) 
distorted configurations of Ceo using the Hiickel model with hopping integrals 
varied according to equation (1). Snapshots of the driven f),(z) orbital are shown 
in Extended Data Fig. 8b. 

The Hubbard Us on each Cgp site are estimated from 


Bn [er —Pevelra) _ 2 
A) n fer ad y) 


where p(r) is the charge density corresponding to a fy, orbital. 
This bare U will be reduced due to screening from valence electrons, intra- 
molecular correlations and metallic background in a solid. However, since we 
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are concerned with relative changes in U this renormalization is not crucial. We 
estimated F° by assuming point charges on each C atom and calculating their 
mutual interactions as 


P,Pje” 


Payee 
tay ATE esreO|M — 12] 


+ Lp Fe (3) 

1 
where p; is the probability given by a ty, orbital of an electron being on the ith C 
atom, and Fe = 12 eV is the atomic on-site repulsion*”. The variation of U with 
the vibrational driving was then obtained using the p; values of the driven thy 
orbitals and is shown, together with the single-particle energies of the orbitals 
over a vibration period, in Extended Data Fig. 8c, d. The amplitude of the vibra- 
tional driving A denotes the maximum displacement of any C atom. For A=5 pm, 
both quantities in the y, z t), orbitals vary by nearly 10% at twice the frequency of 
the vibration. 

Since the vibration is polarized along the x axis, the y, z orbitals remain degen- 
erate and possess identical U values. The modulation in y, z is significantly larger 
than that in x because these orbitals have the equatorial distribution running across 
the vibrational axis, causing their charge distribution to compress at the poles of 
the x axis (Extended Data Fig. 8b). 

The modulation magnitudes as a function of vibrational driving amplitude are 
shown in Extended Data Fig. 8e, f. An amplitude A =5 pm is consistent with the 
strong vibrational driving considered in this work. 

Sample size. No statistical methods were used to predetermine sample size. 
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AEx(t, T), measured at T= 3 ps (b) and at T= 1 ps (d). Data are shown both 
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Extended Data Figure 3 | Models for penetration depth mismatch. 

a, Schematics of pump-probe penetration depth mismatch. b, c, Also 
shown are the single-layer model (b) and the multi-layer model with 
exponential decay (c) (see Methods) used to calculate the pump-induced 
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changes in the complex refractive index, i(w). d-f, Reflectivity, R(w), and 
complex optical conductivity, o(w), of K3Cgo at T= 1 ps pump-probe delay 
and T=25K, extracted using the single-layer model (light blue) and the 
multi-layer model with exponential decay (dark blue). 
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Extended Data Figure 4 | Equilibrium optical properties. a, Reflectivity 
(left column), and real and imaginary parts (respectively middle and 
right columns) of the optical conductivity of K3C¢o displayed at different 
temperatures above T.. Dashed lines are fits to the 25 K data performed 
with a Drude-Lorentz model. b, Same quantities displayed at different 

T <T.. In the inset, the temperature dependent optical gap (filled circles) 
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is compared with previously published data on K3C,p single crystals 
(open circles!*). c, Optical properties of compressed powders of K3C¢o 
from a and b shown at representative temperatures below and above T.. 
The R(w) has been recalculated at the sample-vacuum interface. d, Same 
quantities as in c, measured on single crystals'*. 
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at T= 25K. Data have been measured with a pump fluence of ~1 mJ cm~” 
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Extended Data Figure 7 | Mode coupling and electronic structure a’F(w). In b and ¢, blue lines are for the equilibrium structure and red 
calculations. a, Calculated total energy curves as a function of Hg(1) mode _ lines are for the structure displaced along the H,(1) coordinate with an 
amplitude (Quay) when the amplitude of the T,,(4) mode is 0.0 (blue) and —_ amplitude of 1.5 A Vamu. d, Differential changes in the electron-phonon 
2.0A Vamu (red). b, Calculated band structure and electronic density coupling function, evaluated from the curves in c. 

of states N(E) of K3C¢o. ¢, Calculated electron-phonon coupling function, 
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Extended Data Figure 8 | Changes in time of single particle energy and 
Coulomb repulsion. a, Depiction of the x, y, z tj, orbital wavefunctions 
of Ceo according to the Hiickel model (left, middle and right column 
respectively, see Methods). Sphere sizes denote the magnitude of the 
wavefunction and colours indicate the sign. b, Snapshots of the calculated 
tiy(z) orbital at various points in the T),,(4) vibration polarized along the 

x axis. Colours and sizes follow those of a. c, Changes in the single-particle 
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energies, ¢;, of the ft, orbitals over one period of the T),(4) vibration with 
an amplitude A = 5 pm. d, Relative changes in the intra-orbital Coulomb 
repulsions, U;/U;(0). e, Maximum relative changes in the single-particle 
energies of the ty orbitals, ¢;/¢;(0), as a function the driving amplitude, A. 
f, Relative changes in the intra-orbital Coulomb repulsions, U;/U;(0), 
under the same driving conditions. 
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Extended Data Figure 9 | Response at early time delays below and above T,. Reflectivity, R(w), and complex optical conductivity, o(w), of K3Cgo at 
equilibrium (red, dark blue) and 1 ps after photo-excitation (light blue), measured at T > T. (a-c) and T < T. (d-f). Data were taken using pump fluences 
of ~1 mJ cm~? in a-c and ~0.5 mJ cm~? in d-f. Hatched areas highlight pump-induced changes. 
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Extended Data Figure 10 | Relaxation dynamics at T < T.. Columns as Extended Data Fig. 9; data were measured at equilibrium (grey) and after 
photo-excitation (red) at T= 10K, with a pump fluence of ~0.5 mJ cm~”. Data are shown at selected pump-probe time delays: 3 ps (top row), 
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A pentanuclear iron catalyst designed for water 


oxidation 
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Although the oxidation of water is efficiently catalysed by the 
oxygen-evolving complex in photosystem II (refs 1 and 2), it 
remains one of the main bottlenecks when aiming for synthetic 
chemical fuel production powered by sunlight or electricity. 
Consequently, the development of active and stable water oxidation 
catalysts is crucial, with heterogeneous systems** considered more 
suitable for practical use and their homogeneous counterparts more 
suitable for targeted, molecular-level design guided by mechanistic 
understanding” ’’. Research into the mechanism of water oxidation 
has resulted in a range of synthetic molecular catalysts, yet there 
remains much interest in systems that use abundant, inexpensive 
and environmentally benign metals such as iron (the most abundant 
transition metal in the Earth’s crust and found in natural*®”! and 
synthetic” oxidation catalysts). Water oxidation catalysts based 
on mononuclear iron complexes have been explored”!”!©!8, but 
they often deactivate rapidly and exhibit relatively low activities. 
Here we report a pentanuclear iron complex that efficiently and 
robustly catalyses water oxidation with a turnover frequency of 
1,900 per second, which is about three orders of magnitude larger 
than that of other iron-based catalysts. Electrochemical analysis 
confirms the redox flexibility of the system, characterized by six 
different oxidation states between Fe!!; and Fe!"!;; the Fe! state 
is active for oxidizing water. Quantum chemistry calculations 
indicate that the presence of adjacent active sites facilitates O-O 
bond formation with a reaction barrier of less than ten kilocalories 
per mole. Although the need for a high overpotential and the 
inability to operate in water-rich solutions limit the practicality 
of the present system, our findings clearly indicate that efficient 
water oxidation catalysts based on iron complexes can be created by 
ensuring that the system has redox flexibility and contains adjacent 
water-activation sites. 

We assumed that for iron-based molecular complexes to effectively 
catalyse water oxidation (2H,O — O,+4H* +4e’ ), they should have 
multinuclear structures and contain two water-activation sites sepa- 
rated by a distance appropriate for promoting intramolecular O-O 
bond formation. A multinuclear core typically gives rise to redox 
flexibility that is expected to be advantageous when reactions involve 
multi-electron transfer (Fig. 1a), as illustrated by multinuclear clus- 
ters acting as active centres in enzymes catalysing such reactions (for 
example, the Mn,Ca cluster in photosystem II for the four-electron 
oxidation of water’, and the Fe7Mo cluster in nitrogenase for the six- 
electron reduction of nitrogen”’). The benefit of the catalyst containing 
two suitably placed activation sites (Fig. 1b) is due to kinetics, in that 
intermolecular O-O bond formation through association of two mol- 
ecules is often so slow that it can be the rate-determining step!!!4!>"!7 


in the water oxidation reaction—and a catalyst design that ensures 
intramolecular O-O bond formation avoids such a slow association 
step. Note that dinuclear iron cores are known to serve as active sites 
for O-O bond formation and/or scission”*”>. 

Implementing this strategy, we employed a pentanuclear iron 
complex with open coordination sites, [Fel Fe! (y13-O)(1-L)6]3* 
(1; LH = 3,5-bis(2-pyridyl)pyrazole; Fig. 1c). 1 has quasi-D3 symmetry 
and consists of a [Fe3(13-O)] core wrapped by two [Fe(|1-L)3] units. The 
two iron ions at the apical positions are hexa-coordinated by three L” to 
form [Fe(1-L)3] units with distorted octahedral geometries, whereas 
the three iron ions in the triangular [Fe3(|13-O)] core are penta- 
coordinated by two L~ and an O*~ anion with distorted trigonal 
bipyramidal geometries. These penta-coordinated, or coordinatively 
unsaturated, iron ions in the triangular core are expected to serve as 
the active sites for O-O bond formation. 

We synthesized complex 1 by mixing stoichiometric amounts of 
Fe(SO,4)2-7H20 and LH in the presence of base in MeOH at room 
temperature under aerobic conditions”. Single crystal X-ray struc- 
tural analysis, elemental analysis, electrospray-ionization time-of-flight 
mass spectrometry and °’Fe Méssbauer spectroscopy revealed that one 
iron ion in the triangular core of 1 is oxidized to Fe'", leading to the 
isolation of the Fe!,Fe!" oxidation state (for experimental details, see 
Supplementary Information sections 1, 3 and 4). 

The cyclic voltammogram (CV) of 1 (0.2 mM) in an acetonitrile 
solution containing EtsNClO, (0.1 M) at a scan rate of 10 mV st 
(Fig. 2a) revealed one reversible reduction wave at E1;2=—0.55 V 
versus ferrocene/ferrocenium (Fc/Fc*) and four reversible oxidation 
waves at E}/2 = 0.13, 0.30, 0.68 and 1.08 V, assigned to five sequentially 
occurring one-electron redox reactions involving in turn each of the 
iron ion Fe!!!/Re!! couples. Thus, 1 possesses high redox flexibility, with 
six different redox states that are accessible under electrochemical 
conditions. The open-circuit potential of 1 was determined to be 
—0.26 V, indicating a Fe!! Fe"! state in solution and the solid state (see 
above). Addition of water dramatically changed the profile of the CV of 1 
(Fig. 2b): large irreversible anodic currents appeared at the potential 
corresponding to the Fe!”’,/Fe!/Fe!!, redox couple, suggesting electro- 
catalytic O2 evolution from water. 

To quantify the evolution of O2 from water, controlled potential 
electrolysis was conducted in a two-compartment cell separated by 
an anion-exchange membrane (Supplementary Fig. 3). A solution of 
1 (0.2 mM) in an acetonitrile/water (10:1) mixed solution containing 
Et4NClOg (0.1 M) was electrolysed at an indium tin oxide (ITO) 
electrode with a cell operating potential of 1.42 V versus Fc/Fc*, and 
the evolved O, was quantified by gas chromatography. With a total 
amount of passed charge of 41.0 C over a period of 120 min (Fig. 2c) 
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Figure 1 | Design of molecular catalysts for water oxidation. a, Redox 
flexibility arising from a multinuclear core. Multi-electron transfer to 
afford several oxidation states and electron rearrangement among valence 
tautomers enable the accumulation of positive charges required for water 
oxidation. b, Adjacent water-activation sites to promote intramolecular 
O-O bond formation. c, Ball-and-stick representations of the molecular 


leading to 2.5 ml of evolved Oy, the Faradaic efficiency is 96% based 
ona4e™ process. We also note the nearly linear nature of the water 
oxidation curve, indicating that the steady-state intermediate of the 
catalyst remained stable in solution throughout the catalytic cycles. 
Electrochemical testing of the stability of 1 during catalysis revealed 
that both the wave shape of each Fe!""/Fe!! process and the catalytic 
current for water oxidation were maintained over 100 scans of the CVs 
for 9h (see Supplementary Fig. 4) and after controlled potential elec- 
trolysis for 1h (see Supplementary Fig. 5). There was no evidence for 
formation of insoluble material on the electrode after the controlled 
potential electrolysis (Supplementary Figs 6-8). In the absence of 1, 
catalysis was not observed (black line in Fig. 2c). Taken together, these 
observations strongly indicate that 1 works as a robust homogeneous 
water oxidation catalyst. 

Figure 3 illustrates a possible catalytic cycle for the generation of O2 
mediated by 1, informed by electrochemical measurements revealing 
a four-step, one-electron oxidation that converts the initial Fe! Fell 


~ ya / N 
\ N HN—N N= 
[Fe" Fe" (1,-O)(u-L)_]* (1) LH 


@ Oxygen @ Hexa-coordinated Fe e Penta-coordinated Fe 


structure (left) and the Fe;O core structure (right) of [Fe",Fe!!(\13-O) 
({t-L)o]?* (1); the chemical structure of LH is also shown (bottom right). 
Three penta-coordinated iron centres are bridged by an oxygen atom in 
\t3-fashion to form a triangle structure, and two hexa-coordinated iron 
centres are connected to the triangle structure by six Ls. 


state (So in Fig. 3) to the Fe!!!; state (S4). Note that water attack on the 
Fe!,Fe!!! (So), Fel4;Fe!!4, ($1), Fel Fel; (S2) and FeFe!!, states ($3) 
can be excluded because their oxidation potentials were unchanged 
upon water addition (Fig. 2b). No change in the onset potential of 
water oxidation was seen in CVs performed over a range of solution 
pH (Supplementary Fig. 9), indicating that water attack on the S4 
state does not involve proton-coupled electron transfer. Therefore, 
subsequent insertion of a water molecule into the S4 state results in 
the formation of the Fe!”';(OH)) species with a hexa-coordinated Fell! 
ion in the triangular core (A in Fig. 3). Quantum chemical calcula- 
tions of the system using density functional theory (for details, see 
Supplementary Information sections 9.1 and 9.2) gave an estimated 
activation barrier for the water attack of about 15 kcal mol! rela- 
tive to the S4 state (see Supplementary Videos 1-4 for illustrations 
of insertion pathways). Coordination of another water molecule, 
together with sequential or simultaneous deprotonation, results in 
the formation of the bis-oxo species that calculations suggest to be at 
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Figure 2 | Evaluation of electrochemical properties and catalytic 
activity of 1. a, CVs of 1 (0.2 mM) in an acetonitrile solution with 
EtsNClO, (0.1 M) swept to negative (left) and positive (right) potentials 
from the open circuit potential at a scan rate of 10mVs_'. Red vertical 
dotted lines represent the redox potentials of five reversible redox waves. 
The oxidation states of iron ions are indicated by the red and blue coloured 
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labels. b, CVs of 1 (0.2 mM) in acetonitrile solutions with EtyNClO4 

(0.1 M) at a scan rate of 10mVs~! with 5 M of HO (red line) and without 
water (black line, the same data as in Fig. 2a). c, Electrolysis data in 
acetonitrile/water (10:1) mixed solutions with EtsNC1O, (0.1 M) ata 
potential of 1.42 V versus Fc/Fc* with 0.2 mM of 1 (red line) and without 
the catalyst (black line). 
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Figure 3 | Mechanism of O, evolution from water catalysed by 1. 
Bottom, one water molecule reacts with the S4 state formed by the 
four-electron oxidation of the Sg state (via S;, $2, $3). Further reaction 
with an additional water molecule (S4— A) and the deprotonation of the 


an energy minimum in the mixed-valence Fe"',Fe!(Fe'Y =O), state 
(B in Fig. 3). Note that intramolecular electron transfer from the 
hexa-coordinated Fe!” ions in the triangular core to iron ions at the 
apical position results in the formation of Fe!’=O units at the tri- 
angular core. The O-O bond formation can proceed between the 
co-facial oxo groups in B to afford the peroxo species (C in Fig. 3) 
without spin rearrangement and with an estimated reaction barrier of 
less than 10 kcal mol! (for details, see Supplementary Information 
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Figure 4 | Determination of the TOF value for electrocatalytic O2 
evolution. a, CV of 1 (0.2 mM) in an acetonitrile/water (10:1) mixed 
solution with EtyNCIO, (0.1 M) at scan rates of 0.2-2.6 Vs. b, A plot 
of the ratio of icat to ip as a function of the inverse of the square root of 
the scan rate v, showing the linear relationship. Data obtained from the 
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coordinated water molecules afford the adjacent two metal-oxo moieties 
in one molecule (B), which promotes rapid intramolecular O-O bond 
formation (B — C) to evolve O2 (C — Sp). Fe", orange; Fell, green; and 
Fe!Y red. 


sections 9.1-9.3). Liberation of O2 from C forms the So state to close 
the catalytic cycle. 

To evaluate catalytic activity of 1 in terms of turnover frequency 
(TOF) under electrochemical conditions, we used equation (1) below. 
We note that when the electron transfer rate of the catalyst and the 
concentration of the substrate are sufficiently high and when the cat- 
alytic current is limited only by the chemical reaction in solution, that 
is, under purely kinetic conditions!*””-”’, the equation can be used to 
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measurements with scan rates (in Vs!) of 1.60, 1.80, 2.00, 2.20, 2.40 and 
2.60 were used for the calculation of the TOF. The TOF value was estimated 
from the slope of the plot as 1,900s~'. The red line is forced to go through 
the origin. Each data point represents a single measurement. For the 
numerical data and further details, see also Supplementary Table 13. 


0.8 1.0 


25 FEBRUARY 2016 | VOL 530 | NATURE | 467 
Limited. All rights reserved 


LETTER 


calculate the pseudo-first-order rate constant k,,, which corresponds 
to the TOF value for O2 evolution (for details, see Supplementary 
Information section 10.1). 


1 
ae 
fot = 2.240 AT Py 2 (1) 
ip F 


In this equation, ica is the catalytic current, ip is the peak current meas- 
ured in the absence of the substrate, 1, is the number of electrons 
involved in the catalytic reaction, F is Faraday’s constant, kat is the 
pseudo-first-order rate constant, R is the universal gas constant, T is 
the temperature in kelvin, and v is the scan rate. As shown in Fig. 4a 
and Supplementary Fig. 28, constant limiting currents were observed 
when the scan rates were in the range 1.6-2.6 Vs_'. A plot of the ratio 
Of icat to ip as a function of the inverse of the square root of the scan rate 
shows a linear relationship (Fig. 4b), and gives a TOF estimate of 
1,900 s~! (for details of the calculation of the TOF value, see 
Supplementary Information sections 10.2-10.5). An independent 
though rough estimate can also be derived from the amount of O» 
evolved in the controlled potential electrolysis (Fig. 2c). With the 
assumption that the thickness of the reaction-diffusion layer adjacent 
to the electrode surface (jz) is of the order of 1 ./D/k cat (ref. 30), we 
obtain a TOE value of about 140-1,400s~! and a TON (turnover num- 
ber) of about 10°-10’ for 120 min. These TOF and TON values indicate 
that 1 is an efficient and robust molecular catalyst for water 
oxidation. 

Taken together, our experimental and computational results indicate 
the effectiveness of our catalyst design strategy, where redox flexibility 
of the pentanuclear iron complex enables the accumulation of positive 
charges (Sp—S, in Fig. 3) and intramolecular electron rearrangement 
(A — B) to form a reactive high-valent oxo species, and where adjacent 
active sites ensure formation of cofacial oxo units that can be converted 
to peroxo species via intramolecular O-O bond formation with a low 
activation barrier and without spin-rearrangement (B — C). Given the 
high TOF value achieved with 1, which is about 1,000 times greater 
than that of other reported iron-based catalysts®, we expect that our 
strategy will prove more generally effective for creating from abun- 
dant metals efficient and robust molecular catalysts for water oxidation. 
Regarding the present system, it should be mentioned that the high 
reaction rate was achieved in an acetonitrile/water mixture and using a 
large overpotential of more than 0.5 V (see Supplementary Fig. 9), each 
of which prevents practical use. One of the next goals is therefore the 
design of a water oxidation catalyst operating similarly efficiently and 
robustly but with a low overpotential and in water. 
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carbon sink 


Galen A. McKinley!*?, Darren J. Pilcher!*, Amanda R. Fay’, Keith Lindsay, Matthew C. Long & Nicole S. Lovenduski® 


The ocean has absorbed 41 per cent of all anthropogenic carbon 
emitted as a result of fossil fuel burning and cement manufacture’”. 
The magnitude and the large-scale distribution of the ocean carbon 
sink is well quantified for recent decades*“. In contrast, temporal 
changes in the oceanic carbon sink remain poorly understood’. 
It has proved difficult to distinguish between air-to-sea carbon 
flux trends that are due to anthropogenic climate change and those 
due to internal climate variability>®* 3. Here we use a modelling 
approach that allows for this separation", revealing how the ocean 
carbon sink may be expected to change throughout this century 
in different oceanic regions. Our findings suggest that, owing to 
large internal climate variability, it is unlikely that changes in the 
rate of anthropogenic carbon uptake can be directly observed in 
most oceanic regions at present, but that this may become possible 
between 2020 and 2050 in some regions. 

Recent observationally based syntheses have quantified mean ocean 
carbon uptake and its spatial distribution?*!5 (Extended Data Fig. 1). 
In addition, interior ocean observations analysed under the assumption 
of constant ocean circulation suggest a steady increase in the integrated 
sink over the last century’"'®. Yet surface observations clearly indicate 
that carbon uptake is strongly affected by variability in surface climate 
and ocean circulation®*’. This variability impedes our ability to 
develop a detailed, regional picture of how the ocean carbon sink is 
changing in response to increasing atmospheric partial pressure of 
carbon dioxide (p,o,) and the associated climate change. Although 
climate models suggest that the ocean should be a net sink for 
anthropogenic carbon for at least the next several centuries, they also 
suggest that climate warming and circulation changes will act to reduce 
the sink’s magnitude”'®. Monitoring current and future effects from 
the combined impact of increasing atmospheric p,o, and climate 
change—that is, the forced trend—on ocean carbon uptake presents a 
major observational challenge due to the strong influence of the vari- 
ability inherent to the climate system'*!7"'8, 

Previous modelling studies have attempted to separate internal 
variability from forced trends in ocean carbon uptake using several 
approaches. Variability in air-to-sea carbon fluxes has been linked to 
modes of climate variability in realistic hindcast models!*-?!, However, 
anthropogenic change can project onto these modes, leading to an 
incomplete separation. The ocean’s response to increasing atmospheric 
CO) in the absence of variability and change has been studied’, but this 
approach ignores both mean impacts on ocean circulation from varia- 
ble climate and indirect impacts on the carbon sink due to circulation 
change. Collections of Earth System Models have been used to assess 
relationships between natural variability and carbon cycle trends”, 
but diverse model structures—for example, the spatial resolution of 
the atmosphere and ocean components, parameterization of the lower 
food web, or numerical schemes—can influence resulting trends”*. 
Structural uncertainty precludes clear identification of the influence 
of internal variability”. 


We make use here of a large ensemble of a single Earth system model, 
the Community Earth System Model-Large Ensemble (CESM-LE, 
ref. 25) to assess variability and change in the ocean carbon cycle in 
recent decades and through 2100. CESM is a comprehensive coupled 
climate model consisting of atmosphere, ocean, land surface, and sea 
ice components. The CESM-LE experiment includes 32 members with 
ocean biogeochemistry output. The experiment included a control inte- 
gration of >2,000 years. A transient integration (ensemble member 1) 
started at year 402 of the control and was integrated for 251 years under 
historical forcing (1850-2005) and then the Intergovernmental Panel 
on Climate Change Representative Concentration Pathway (IPCC 
RCP) 8.5 scenario for 2006-2100. Additional ensemble members were 
initialized from ensemble member 1 at 1 January 1920, with round-off 
level perturbations applied to the air temperature field. See Methods 
for more details. 

Here, the use of a single model eliminates structural differences 
inherent to multi-model ensembles”, allowing the spread across the 
ensemble to be wholly attributed to the internal variability of the mod- 
elled climate system !1718:26, For each ensemble member, temporal 
trends in any variable can be separated into two parts: (1) the forced 
trend that is common across all ensembles, and (2) the unforced, or 
internal trend, that occurs only in that ensemble member. The spread 
of trends across the ensemble indicates how much internal variability 
causes individual ensemble members to deviate from the forced 
trend'*+!”'86. The forced trend, as its name suggests, is due to the 
model forcing, here including anthropogenic greenhouse gases and 
aerosols, as well as natural forcings (for example, solar variability and 
volcanoes) during the historical period'*”». In the case of the ocean 
carbon sink, there are two components to the forced trend. The first is 
the direct influence of increasing atmospheric p.,, driving continued 
ocean carbon uptake. The second is the indirect effect of changing 
climate that influences the physical state of the ocean and modulates 
air-to-sea carbon fluxes. 

Comparisons to observations illustrate that CESM captures the 
dominant modes and magnitudes of ocean carbon cycle variability and 
trends at regional to global scales (ref. 21, Methods, Extended Data 
Tables 1 and 2). To further validate the simulated mean CO, flux, we 
compare to fluxes estimated from observations’, and to a multi-model 
ensemble of 12 Coupled Model Intercomparison Project 5 (CMIP5) 
Earth system models (Fig. 1, Extended Data Fig. 1, Extended Data 
Table 3). For the 30-year mean CO) flux, CESM-LE is consistent with 
the observed estimates in most regions and for the global average, with 
small differences across the individual ensemble members (Fig. 1). 
In contrast, there is a substantial spread in CO) flux estimates from 
CMIP5”>. It is to be expected that structural differences between models 
would dominate differences in the multi-decadal mean CO, flux, since 
the long averaging period integrates over the timescales of the domi- 
nant modes of variability. This is exactly what we find; a much greater 
spread in mean CO) flux for CMIP5 than for CESM-LE for all biomes 
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Figure 1 | Modelled and observed mean 1982-2011 CO) flux in 15 ocean 
biomes. The CESM-LE mean is shown as a grey cross, with max/min in 
grey (N= 32), aligned with horizontal lines for each biome. Coloured 

dots represent CMIP5 models (N= 12), corresponding to the biome 
colours on the inset mean biome map (ICE, dark blue; SPSS, light blue; 
STSS, green; STPS, yellow; EQ, orange and red; full names are given in 


(Fig. 1). These structural differences across CMIP5 also affect CO; flux 
trends (Methods, Extended Data Fig. 2), indicating that a clean sepa- 
ration of forced trends from trends driven by internal variability is not 
possible with the CMIP5 multi-model ensemble, as it is with CESM-LE. 

Our model analysis considers forced trends and the spread of inter- 
nal trends in the ocean carbon sink across timeframes from decadal 
to centennial, all starting in 1990. Forced trends are shown only if they 
can be distinguished from trends due to internal variability with 95% 
confidence! (Methods). 

For the decade starting in 1990, internal variability is large enough to 
preclude identification of forced trends in the carbon sink across most 


a 1990-1999 b 


1990-2019 


CESM-LE mean 
CO, flux trend 


CESM-LE trend 
standard deviations 


Figure 2 | Forced trends and internal variability of CESM-LE trends in 
sea-to-air CO, flux. Forced trends for a, 1990-1999, b, 1990-2019 and 
c, 1990-2089. Light-grey ocean areas show where the forced trend cannot 
be identified with 95% confidence (Methods). CO, flux trend standard 
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Extended Data Table 4). There is insufficient data to plot the Northern 
Hemisphere ICE biomes”. The Atlantic Ocean is offset by -3molCm~“*yr~!, 
and the Southern and Indian oceans by +3 molCm “yr. The global mean 
is shown in the bottom right inset (scale is twice that of the main panel); the 
uncertainty on the observed value (grey band) is 0.12 molCm yr (ref. 28; 
P. Landschiitzer, personal communication). 


of the ocean (Fig. 2a). Internally driven variability in trends (Fig. 2d) 
is largest in the equatorial Pacific due to El Nifio/Southern Oscillation 
effects, and in regions of strong seasonal and interannual climate vari- 
ability, such as the high latitudes of the North Pacific and Atlantic and 
north of seasonal sea ice in the Southern Ocean*8!1b13222728 Only 
in the subpolar North Atlantic, equatorial Atlantic and in some loca- 
tions in the Southern Ocean are forced trends large enough to emerge 
from the variability over this period, and in these locations CO, uptake 
increases (Fig. 2a). 

Asa result of anthropogenic CO) emissions from 1990-2019, the 
ocean carbon sink increases in most locations outside the subtropics 
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deviations, indicating the impact of internal variability on CO; flux trends, 
for d, 1990-1999, e, 1990-2019 and f, 1990-2089. Negative values indicate 
increasing ocean carbon uptake. 
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(Fig. 2b). In isolated regions within the subtropics, the forced trend in 
carbon uptake for 1990-2019 is not large enough to be identifiable at 
the 95% level, despite the fact that internally driven variability is sub- 
stantially reduced relative to the decadal timeframe (Fig. 2d, e). Over 
100 years (1990-2089), anthropogenic forcing leads to strong increases 
in uptake in the high latitudes, and to reduced outgassing in the equa- 
torial Pacific and the eastern upwelling zones off South America and 
Africa. In the Pacific and Indian subtropics, the forced trend illustrates 
weakened carbon uptake by 2100 (Fig. 2c). Internal variability has 
minimal impact on 100-year CO) flux trends (Fig. 2f). 

The ocean’s capacity to absorb increasing amounts of anthropogenic 
CO, is not uniformly distributed. Across multi-decadal to centennial 
timescales, CO> flux does not change or decreases in the subtropical 
gyres (Fig. 2b, c). This is consistent with a convergent large-scale cir- 
culation and strong stratification that isolates the surface from the 
deep ocean's large capacity to hold carbon. Long-term warming also 
reduces CO? solubility!°'*!®. In contrast, the regions where ocean 
carbon uptake strongly increases are those with strong exchange 
between the surface and the deep ocean. In the equatorial Pacific, in 
eastern boundary zones, and in the Southern Ocean, upwelling deep 
waters have been out of contact with the atmosphere for hundreds of 
years and thus hold little, if any, anthropogenic carbon. As time pro- 
gresses, upwelling waters encounter an ever-higher atmospheric p.o,, 
which diminishes outgassing of natural carbon*””8 (Extended Data 
Fig. 1). In the North Atlantic, the direction of the exchange with the 
deep is reversed, with surface waters being transformed into deep 
waters by rapid buoyancy loss and deep convection. During this trans- 
formation, these waters increasingly absorb more carbon as atmos- 
pheric poo, rises. There is a large-scale correspondence of the regions 
of mean carbon uptake at present? with the regions where carbon 
uptake is predicted to grow most rapidly in the twenty-first century. 

The ability to separate forced from internal trends in CESM-LE 
(Fig. 2) allows for an assessment of timescales over which observations 
would be required in order to detect anthropogenically driven change 
in ocean carbon uptake from observations (Fig. 3). Consistent with 
previous studies*®??, detectability is assessed using ‘time of emergence, 
which is the year in which the signal of the forced trend would emerge 
from the noise of the internal variability. This analysis assumes obser- 
vations began in 1990 (Methods). 

The forced trend emerges early (by 2010) in some of the Southern 
Ocean and the Atlantic, where there is a large short-term change in 
the sink. Given the strong internal variability and the smaller forced 
trend in the equatorial Pacific, time of emergence is generally interme- 
diate here (by 2030-2050). The latest emergence occurs in the Pacific 
and Indian subtropical regions (2050+). Where the net effect of the 
forcing is to drive long-term steady carbon uptake, no change should 
be detected before 2100 (white areas in Fig. 3). If internal variability 
were to be substantially underestimated or overestimated at a location, 
estimates of time of emergence would be too short or too long, respec- 
tively. However, comparison with data indicates that CESM-LE captures 
carbon cycle variability reasonably well (Extended Data Tables 1 and 2). 

Using our current observational system for surface ocean carbon, 
should we be able to detect these predicted changes? At seven ocean 
time series stations, direct measurements of the ocean carbon 
cycle have been made at quarterly to monthly intervals for one to 
several decades? (Fig. 3). In the Atlantic, these locations are situated 
such that if observations had occurred since 1990 at a frequency 
sufficient to constrain the annual mean flux, they should be able 
to reveal change in the ocean carbon sink as distinct from internal 
variability at present (Irminger Sea, by 2015) or in the near future 
(BATS, ESTOC, CARIACO, by 2020; Iceland Sea, by 2040) (Fig. 3). 
However, for the Pacific sites, detection of change in carbon uptake 
should not be expected until at least 2050 (HOT, by 2050; Munida, 
beyond 2100). Unfortunately, at the time series site where CESM-LE 
suggests the forced trend may be first detectable (Irminger Sea), the 
Pco, data set is short (1983-2005) and highly variable’, making it 
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Figure 3 | Time of emergence for sea-to-air CO, flux. Time of emergence 
is when the forced trend becomes detectable given the internal variability 
(Methods). Blue stars indicate seven ocean timeseries stations’, from 
North to South in the Atlantic: (1) Iceland Sea, (2) Irminger Sea, (3) 
Bermuda Atlantic Time-series Study (BATS), (4) European Station for 
Time series in the Ocean at the Canary Islands (ESTOC), (5) Carbon 
Retention In A Colored Ocean (CARIACO) and from North to South in 
the Pacific, (6) Hawaii Ocean Time-series (HOT) and (7) Munida. Mean 
times of emergence for each biome are presented in Extended Data Table 4. 


impossible to determine if a trend towards increasing carbon flux is, 
in fact, occurring. 

Surface ocean carbon data from volunteer commercial and scientific 
ships are presently too sparse for direct estimation of multi-decadal 
carbon cycle trends in most regions**!*!, However, in the subtropics 
of the North Atlantic and Pacific, there are sufficient data to indicate a 
steady ocean carbon sink, and in the equatorial Atlantic to indicate an 
increasing sink for 1981-2009". These changes are consistent with the 
30-year forced signals expected from CESM-LE (Fig. 2b). More data, 
from all sources, will be required to determine whether these signals 
are, in fact, illustrating the forced trend in ocean carbon uptake!?, 

Going forward, ocean carbon monitoring efforts can benefit from 
this new ability to separate internal variability from forced trends. 
Long-term records can be interpreted in the context of the expected 
forced change in the ocean carbon sink; monitoring can be targeted 
to regions where the largest forced changes are expected; and regional 
aggregation approaches that optimally seek the forced signal can be 
developed. Concurrently, expansion of these analyses to large ensem- 
bles of other Earth system models!*”* will further elucidate the mecha- 
nisms, magnitudes, and timescales of forced trends in the ocean carbon 
sink. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


The Large Ensemble of the Community Earth System Model. The Community 
Earth System Model (CESM) is a comprehensive coupled climate model consisting 
of atmosphere, ocean, land surface, and sea ice component models”. The ocean 
physical model is the ocean component of the Community Climate System Model 
version 4°! The model has nominal 1° horizontal resolution and 60 vertical levels. 
Mesoscale eddy transport, diapycnal mixing, and mixed layer restratification by 
submesoscale eddies are parameterized with state-of-the-art approaches. The bio- 
geochemical-ecosystem ocean model includes multi-nutrient co-limitation on 
phytoplankton growth and specific phytoplankton functional groups as well as 
full-depth ocean carbonate system thermodynamics, sea-to-air CO) fluxes, and a 
dynamic iron cycle*’. The biogeochemical-ecosystem model compares favourably 
to observations, though there are some important biases, including weak Southern 
Ocean CO; uptake”). 

The CESM-LE began with a multi-century 1850 control simulation with con- 
stant pre-industrial forcing; the ocean physical state was initialized from observa- 
tions, ocean biogeochemical tracers were initialized from a separate 600-year 
spin-up, and other component models were initialized from previous CESM1 
simulations. Once the control simulation climate achieved quasi-equilibrium with 
the 1850 forcing, the first ensemble member was initialized from a randomly 
selected year in the 1850 control run: 1 January, model year 402. Ensemble 
member 1 was integrated forward from 1850 to 2100. The remaining ensemble 
members were integrated from 1920 to 2100 using slightly different initial condi- 
tions: Ensemble member 2 used one-day lagged ocean initial conditions, while 
spread in the remaining ensemble members was generated by round-off level per- 
turbations to their initial air temperature fields”®. After initial condition memory 
was lost, each ensemble member evolved independently. A total of 38 ensemble 
members were generated in this fashion, but 6 of these had corrupted ocean bio- 
geochemical output due to a setup error and affected fields were discarded. All 
ensemble members have the same specified external forcing: historical forcing 
from 1920 to 2005, and RCP8.5 forcing from 2006 to 2100. Differences from 
observed atmospheric p.,, for RCP8.5 for the 2006-2014 period are minimal”. 
Since atmospheric CO; concentrations are prescribed, CESM-LE ocean carbon 
fluxes do not feed back on the modelled climate. 

Analysis methods. We consider the linear trend at each model gridcell of annual 
mean CO) flux, in units of moles of carbon per square metre per year squared. 
The trend for CO flux is calculated for each ensemble member. The forced trend 
is the average trend across the 32 ensembles. Each ensemble member's unforced 
(internal) trend, due to internal variability, is the difference between that ensem- 
ble’s trend and the forced trend. The 95% confidence level for identification of the 
forced trend is calculated, for each grid cell and time frame, based on the number 
of ensembles required to resolve the ensemble mean response: Nin = 8/(X/o)?, 
where X is the forced trend and a is the standard deviation of trends". If Nmin 
exceeds the number of ensembles in CESM-LE (Nensembles = 32), the forced trend 
cannot be identified with 95% confidence. Time of emergence is the first year in 
which the signal-to-noise ratio exceeds a threshold value of 2, where the signal is 
the forced trend and the noise is the ensemble standard deviation”’. For efficiency 
of computation and presentation, signal-to-noise ratios are calculated at 5-year 
intervals (that is, 1990-1995, 1990-2000, 1990-2005, and so on). The signal- 
to-noise ratio must remain greater than 2 for all subsequent years. 

Model comparisons to observations. To assess the representation of internal var- 
iability in CESM-LE, Extended Data Table 1 compares CESM-LE modelled to 
observed variability in annual mean p,,, and CO; flux for 1982-2011, and 
Extended Data Table 2 compares trends over the same period. poo, data are from 
the Surface Ocean CO) Atlas (SOCATv2)*? averaged to monthly means at 1° x 1° 
resolution. CESM-LE members are each sampled in po, to reflect the data density 
available in SOCATv2. A background mean climatology! is removed at 1° x 1° 
resolution in order to address the potential of spatial aliasing when averaging to 
biome-scale!?*+>, An area-weighted average is then used to arrive at biome-scale 
annual means, and the 30-year trend is removed before calculating the standard 
deviation. For the CO, flux, we utilize monthly 1° x 1° resolution flux estimates 
that have full spatial and temporal coverage over the period 1982-20117. These 
estimates are based on the same Poor data set (SOCATv2). With the full global 
coverage of the CO; flux product, there is no need to sample or to remove a back- 
ground climatology from CESM-LE before biome averaging. Otherwise, the same 
processing is employed as for pcg,. The uncertainty reported in Extended 
Data Table 1 is one standard deviation of the variability represented by the 32 
CESM-LE members for each variable. There is insufficient data to make an inde- 
pendent uncertainty estimate with respect to variability from the observations. In 
Extended Data Table 2, linear trends in observed annual mean p Gos and CESM-LE 
Pco» sampled in the same way as these observations, are compared. Sampling in 
the same way as the observations allows for a direct model-to-observation 
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comparison in spite of the fact that the sparse data coverage may lead to inaccurate 
observed estimates of annual mean p_,, for some biomes in some years. Since the 
CO) flux product offers full coverage in space and time, there is no need for 
sampling prior to the calculation of trends. 

Within the uncertainty, modelled Poor variance is correct in seven of the 
biomes, underestimated in five biomes and overestimated in three biomes 
(Extended Data Table 1). However, in two of the three biomes where Poor 
variability is overestimated by the model (SO STSS, SO SPSS), comparison to the 
CO), flux product suggests that the model underestimates variability. In the third 
(NP STPS), the flux product comparison indicates that the model appropriately 
simulates variability. Conversely, in the biomes where p,, variability is underes- 
timated, the CO flux product comparison indicates either variability consistent 
with observations (NA STSS, EQ Atl), too high (NP STSS), or too low (NA SPSS, 
SA STPS). Similarly, in the biomes where p,,, variability is consistent with the 
observations, the CO) flux comparison indicates overestimation by the model (East 
EQ Pac, West EQ Pac, IND STPS), underestimation (NP SPSS, SO ICE), or con- 
sistency (SP STPS). 

Modelled trends in Poor and CO) flux (Extended Data Table 2) are largely 
consistent with observed trends, given the uncertainty. In one biome (West EQ 
Pac), the trend in Poon in the model is overestimated, though in this biome the 
CO, flux trend is consistent with the observed estimates. In three biomes (NP 
STPS, East EQ Pac, IND STPS), the flux trend is too large, and in one (SO SPSS), 
it is too small. However, in all four of these biomes, the p COs trends are consistent 
with the observed estimates. There is no clear relationship between over- and 
underestimation of trends and over- and underestimation of variability (Extended 
Data Table 1). 

In the CESM-LE, po, variability and trends dominantly control CO) flux var- 
iability and trends”). Thus, the fact that these comparisons for p,,, and CO) flux 
variability and trends differ substantially suggests that there is additional, unquan- 
tified uncertainty driven by the sparse sampling for pcg, and assumptions made 
in the development of the flux product”®. That CESM- LE falls clearly within the 
range of observed p,,, and estimated CO, flux variability and trends indicates 
that the model's representation of the carbon cycle is, on the whole, consistent with 
our current observational understanding. More observations are needed to better 
constrain internal variability and trends in the surface ocean carbon cycle. 
Forced trends in the CMIP5 ensemble. Twelve CMIP5 Earth system models are 
included in the analysis in addition to the CESM-LE for the historical period. 
The CMIP5 models included are those models that report CO) flux at monthly 
timescales for a historical simulation through 2005 and with the RCP8.5 scenario 
for 2006-2100; see Extended Data Table 3 for included models. The CESM1- 
Biogeochemistry (CESM1-BGC) model included in the CMIP5 model suite is a 
predecessor to the CESM-LE. 

Owing to the combined effect of a smaller number of ensemble members for 
CMIP5 and the larger variability across these ensembles (Extended Data Fig. 2d-f), 
due in part to structural differences”’, the forced trend in CO; flux cannot be 
identified from CMIP5 across most of the global oceans, even for the time frame 
1990-2089 (Extended Data Fig. 2a—c). Where the forced trend from CMIP5 is 
discernible, primarily in the equatorial Pacific and Southern Ocean, it is of the 
same sign as CESM-LE (increasing uptake) but of weaker magnitude. 

Data sources. Surface Ocean CO; Atlas eae y2)8 Poor data were taken from 
www.socat.info/access.html. CO; flux estimates”* were obtained from http://cdiac. 
esd.ornl.gov/oceans/SPCO2_1982_2011_ETH_SOM_FEN.html. 

Model output sources. CESM-LE”’ is available from https://www.earthsystemgrid. 
org/dataset/ucar.cgd.ccsm4.CESM_CAM5_BGC_LE.html. CMIP5°* is available 
from http://www.ipcc-data.org/sim/gcm_monthly/AR5/WG1-Archive.html. 
Code availability. Code for analysis and production of figures is available upon 
request. Please contact G.A.M. (gamckinley@wisc.edu). 
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Extended Data Figure 1 | Comparison of 1982-2011 mean CO, flux. 
a, Data-based climatology (ref. 28). b, CESM large ensemble 32-member 
mean. c, Mean of 12 CMIP5 models. 


© 2016 Macmillan Publishers Limited. All rights reserved 


LETTER 


1990-1999 1990-2019 1990-2089 


CMIP5 mean 
CO, flux trend 
CO, Flux Trend (molC/m?yr?) 


0.05 

0.025 z 
me} 

2% z 
=v £ 
=e oF 
oO 8 5 
rm 
a fell 
0.025 8 

-0.05 


Extended Data Figure 2 | Forced trends and variability of CMIP5 trends — CO, flux trend standard deviations, indicating the impact of variability 
in sea-to-air CO, flux. Forced trends for a, 1990-1999, b, 1990-2019 on CO; flux trends, for d, 1990-1999, e, 1990-2019 and f, 1990-2089. 
and c, 1990-2089. Grey areas are where the forced trend cannot be Negative values indicate increasing ocean carbon uptake. 
distinguished from the variability with 95% confidence (Methods). 
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Extended Data Table 1 | Comparison of observed and modelled peo, and COz flux variability for 1982-2011 


pco- CO; flux 
CESM-LE CESM-LE 
Data mean Data-based mean 
Biome variability variabiilty Uncertainty variability variability Uncertainty 
atm atm tm molC/m?/yr molC/m?/yr molC/m?/yr 

NP SPSS 7.9 10.2 2.0 0.42 0.13 0.02 
NP STSS 7.3 5.9 0.4 0.09 0.13 0.01 
NP STPS 4.2 5.4 0.5 0.05 0.07 0.01 
East EQ Pac 6.3 5.9 0.7 0.07 0.15 0.02 
West EQ Pac 13.0 12.6 2.0 0.23 0.36 0.05 
SP STPS 8.3 6.4 1.0 0.07 0.08 0.01 
NA SPSS 22.4 19.7 1.0 0.25 0.12 0.01 
NA STSS 7.4 5.2 0.6 0.10 0.10 0.02 
NA STPS 5.2 5.6 0.4 0.09 0.06 0.01 
EQ Atl 8.8 6.5 0.6 0.07 0.06 0.01 

SA STPS 9.7 8.5 0.5 0.10 0.06 0.007 
IND STPS 1141 10.6 1.0 0.04 0.10 0.01 
SO STSS 5.9 12.4 2.0 0.14 0.08 0.01 
SO SPSS 4.8 12.8 2.0 0.22 0.11 0.01 
SO ICE 15.4 17.0 2.0 0.22 0.05 0.005 


Variability is the standard deviation of the annual means from 1982 to 2011. Uncertainty of the variability is the standard deviation of the variability estimates for each of the 32 CESM-LE ensemble 
members. Underlining indicates that the modelled variability is lower than the observed variability, and italics indicates that the modelled variability is higher than the observed variability, in both cases 
taking into account the model-estimated uncertainty. Pco2 data are from ref. 33; COz flux data are from ref. 28. 
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Extended Data Table 2 | Comparison of observed and modelled Pco> and CO> flux trends for 1982-2011 


pco, CO, flux 

Data CESM-LE Data-based CESM-LE 

trend forced trend trend forced trend 
Biome (uatm/yr) (uatm/yr) (molC/m*/yr’) (molC/m*/yr’) 
NP SPSS 1.18 + 0.42 1.59 + 0.41 -0.012 + 0.017 -0.004 + 0.002 
NP STSS 1.99 + 0.32 2.16 + 0.24 -0.011 + 0.004 -0.008 + 0.002 
NP STPS 1.76 + 0.19 2.01 + 0.24 -0.008 + 0.002 -0.004 + 0.001 
West EQ Pac 1.21 + 0.28 1.67 + 0.15 -0.001 + 0.003 -0.005 + 0.002 
East EQ Pac 1.43 + 0.60 1.74 + 0.37 -0.023 + 0.010 -0.005 + 0.004 
SP STPS 1.17 +0.41 1.54 + 0.26 -0.006 + 0.003 -0.003 + 0.001 
NA SPSS 1.06 + 1.33 1.44+1.11 -0.014 + 0.11 -0.016 + 0.002 
NA STSS 1.73 + 0.51 1.77 + 0.32 -0.014 + 0.004 -0.014 + 0.001 
NA STPS 1.57 + 0.27 1.88 + 0.28 -0.006 + 0.004 -0.003 + 0.001 
EQ Atl 1.48 + 0.45 1.50 + 0.30 -0.001 + 0.003 -0.005 + 0.001 
SA STPS 0.92 + 0.53 1.49 + 0.44 -0.004 + 0.005 -0.005 + 0.001 
IND STPS 2.35 + 0.73 2.55 + 0.65 -0.008 + 0.002 -0.003 + 0.001 
SO STSS 2.00 + 0.29 1.57 + 0.59 -0.005 + 0.006 -0.007 + 0.001 
SO SPSS 1.58 + 0.24 0.97 + 0.55 0.004 + 0.010 -0.010 + 0.001 
SO ICE 1.42 + 0.84 0.51 + 0.85 -0.010 + 0.010 -0.016 + 0.001 


Underlining indicates that the modelled trend is lower than the observed trend, and italics indicate that the modelled trend is higher than the observed trend, given the uncertainty (95% trend 
confidence intervals). Trends are based on annual means. Pcos data are from ref. 33; COz flux data are from ref. 28. 
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Extended Data Table 3 | CMIP5 models used in this work 


Modeling Group Model Name Citation 
Beijing Climate Center (BCC), 
China Meteorological Administration, China 


BCC-CSM1.1m 49 


Beijing Normal University (BNU), China 
College of Global Change and Earth System BNU-ESM 44 
Science, China 


Canadian Centre for Climate Modeling and Analysis, 37 
. CanESM2 
Victoria BC, Canada 
National Center for Atmospheric Research, USA CESM1-BGC 45 
Centro Euro-Mediterraneo sui Cambiamenti Climatici, 
CMCC-ESM 4 

Lecce, Italy 
NOAA Geophysical Fluid Dynamics Laboratory, USA = GFDL-ESM2M 40 
Met Office Hadley Centre, UK HadGEM2 38 
Institut Pierre-Simon Laplace, 

: . IPSL-CM5-MR 39 
IPSL Climate Modelling Centre, France 
Institute for Numerical Mathematics, Russia INM-CM4. 47 


Japan Agency for Marine-Earth Science and 
Technology, Atmosphere and Ocean Research MIROC-ESM 48 
Institute, The University of Tokyo, Japan 


Max-Planck-Institute for Meteorology, Germany MPI-ESM-LR 42, 43 
Norwegian Climate Centre, Norway NorESM1-ME 46 


See refs 37-49. 
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Extended Data Table 4 | Biome long names and mean time of emergence 


Biome acronym Biome long name Mean Time of 
Emergence 

NP SPSS North Pacific subpolar seasonally stratified 2063 
NP STSS North Pacific subtropical seasonally stratified 2038 
NP STPS North Pacific subtropical permanently stratified biome 2069 
West EQ Pac West Pacific equatorial 2048 
East EQ Pac East Pacific equatorial 2031 
SP STPS South Pacific subtropical permanently stratified biome 2054 
NA SPSS North Atlantic subpolar seasonally stratified 2054 
NA STSS North Atlantic subtropical seasonally stratified 2030 
NA STPS North Atlantic subtropical permanently stratified biome 2051 
EQ Atl Atlantic equatorial 2018 

SA STPS South Atlantic subtropical permanently stratified biome 2027 
IND STPS Indian subtropical permanently stratified biome 2048 
SO STSS Southern Ocean subtropical seasonally stratified 2032 
SO SPSS Southern Ocean subpolar seasonally stratified 2029 
SO ICE Southern Ocean marginal sea ice 2018 


Names are taken from ref. 34. 
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Third-party punishment as a costly signal of 


trustworthiness 


Jillian J. Jordan!, Moshe Hoffman2, Paul Bloom! & David G. Rand!3-4 


Third-party punishment (TPP)!”’, in which unaffected observers 
punish selfishness, promotes cooperation by deterring defection. 
But why should individuals choose to bear the costs of punishing? 
We present a game theoretic model of TPP as a costly signal®!° 
of trustworthiness. Our model is based on individual differences 
in the costs and/or benefits of being trustworthy. We argue that 
individuals for whom trustworthiness is payoff-maximizing will 
find TPP to be less net costly (for example, because mechanisms"! 
that incentivize some individuals to be trustworthy also create 
benefits for deterring selfishness via TPP). We show that because 
of this relationship, it can be advantageous for individuals to 
punish selfishness in order to signal that they are not selfish 
themselves. We then empirically validate our model using 
economic game experiments. We show that TPP is indeed a signal 
of trustworthiness: third-party punishers are trusted more, and 
actually behave in a more trustworthy way, than non-punishers. 
Furthermore, as predicted by our model, introducing a more 
informative signal—the opportunity to help directly—attenuates 
these signalling effects. When potential punishers have the chance 
to help, they are less likely to punish, and punishment is perceived 
as, and actually is, a weaker signal of trustworthiness. Costly 
helping, in contrast, is a strong and highly used signal even when 
TPP is also possible. Together, our model and experiments provide 
a formal reputational account of TPP, and demonstrate how the 
costs of punishing may be recouped by the long-run benefits of 
signalling one’s trustworthiness. 

Costly third-party punishment (TPP) is widely observed in labora- 
tory! *” and field**° experiments (although see ref. 12), and appears to 
be universal across cultures!?. While collectively beneficial, TPP poses 
a puzzle: why should individuals incur the costs of punishment? 

We propose an answer based on reputation*!*"!®. Specifically, we 
introduce a game theoretic model of TPP as a costly signal of trustwor- 
thiness: if you see me punish selfishness, it can signal that I will not be 
selfish to you. Our model involves a partner-choice’? game with two 
roles. In each interaction, the ‘Signaller’ decides whether to send one 
or more costly signals; then the ‘Chooser’ decides whether to partner 
with the Signaller. 

As with all costly signalling models” "”, our model is based on indi- 
vidual differences: two ‘types’ of Signallers differ in their quality as 
interaction partners. For trustworthy types, it is payoff-maximizing to 
cooperate when trusted; for exploitative types, it is payoff-maximizing 
to defect. Choosers thus benefit from partnering with trustworthy 
Signallers, but are harmed by partnering with exploitative Signallers. 

Signallers’ types are fixed, but not directly observable. Therefore, 
Choosers must base their partner choice on the aforementioned costly 
signals. In each interaction, the Signaller’s cost of signalling is either 
small (less than the benefit of being chosen as a partner) or large 
(greater than the benefit of being chosen). It is thus beneficial to signal 
(in order to be chosen) when the cost is small, but not large. The key 
premise of costly signalling is that high-quality types are more likely 
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to experience small signalling costs than low-quality types (and are 
thus more likely to signal). Therefore, signals convey information about 
Signallers’ types, and Choosers benefit from preferring partners who 
signal. 

How does this relate to TPP and trustworthiness? We argue that TPP 
will typically be less net costly for trustworthy types (that is, individuals 
who find it payoff-maximizing to cooperate when trusted). Because 
TPP deters future harm against others, punishing may benefit the 
punisher (for example, via direct reciprocity from the victim of the 
punished transgression, or rewards from institutions or leaders seeking 
to promote cooperation); and these benefits should be larger for trust- 
worthy types, because the same mechanisms!! that make trustworthy 
behaviour advantageous also increase the benefits of preventing harm 
against others. (This argument implies that the costly signalling mech- 
anism we propose may interact positively with other mechanisms for 
TPP that are based on deterrence benefits.) Furthermore, because trust- 
worthy types are more desirable to interact with than exploitative types, 
they typically attract more partners—which may reduce TPP costs by 
offering protection against retaliation and facilitating coordinated pun- 
ishment”°. See Supplementary Information sections 1.2.3 and 1.3 and 
Extended Data Fig. 1 for formal models of these two microfoundations 
for our central argument. 

When TPP is less net costly for trustworthy types, it can serve as a 
costly signal of trustworthiness. Agents should thus sometimes punish 
for the express purpose of signalling their trustworthiness to Choosers 
(like a peacock’s tail signals genetic quality)—specifically, when the 
deterrence benefits of TPP are too small to outweigh the costs on their 
own (otherwise, TPP would occur without signalling), but the reputa- 
tional benefit of appearing trustworthy makes TPP worthwhile. 

Although TPP can convey information about type, there are often 
several possible ways to signal trustworthiness, and TPP is not always 
the most informative. Therefore, a crucial prediction of this signalling 
account is that when a more informative signal is available, the signal- 
ling value of TPP should be attenuated and less TPP should occur. To 
illustrate this fact, our model also includes the possibility of signalling 
via costly helping of third parties: because being trustworthy and help- 
ing both involve paying costs to benefit others, helping should typically 
bea very informative signal of trustworthiness (see Supplementary 
Information section 1.2.3). 

Agents in our model make decisions across three different scenarios 
in which Signallers have the opportunity to engage in (1) TPP, (2) third- 
party helping, or (3) both. In each scenario, Choosers know which 
signals were available to Signallers. An agent’s strategy specifies her 
actions as both the Signaller and Chooser in each scenario. 

Our equilibrium analysis identifies Nash equilibria that are robust 
against indirect invasion (RAII)*! (and thus likely to be favoured by 
natural selection; see Supplementary Information section 2.1). We also 
directly test which strategies are favoured by selection using stochastic 
evolutionary dynamics where agents interact at random to earn payoffs, 
strategies with higher payoffs become more common, and mutation 
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Figure 1 | A model of TPP as a costly signal of trustworthiness. 

Shown are the results of equilibrium calculations (in which we identify 
Nash equilibria that are RAII) and evolutionary dynamics, where Isp 
(Is) is the informativeness of small punishing (helping) costs (that is, 
the ratio between the probabilities that trustworthy versus exploitative 
Signallers experience small punishment (helping) costs), I;p (Izu) is the 
informativeness of large punishing (helping) costs (the inverse ratio, 
using large costs instead of small), b is the expected benefit of partner 
choice (product of the probability a Signaller is trustworthy and the gain 
to Choosers of accepting trustworthy Signallers), and c is the expected 
cost of partner choice (the same product, but for exploitative Signallers). 
a, Equilibria when only punishment is possible. ‘Punishment-signalling, 
in which Signallers punish when their punishing costs are small and 
Choosers only accept punishers, is an equilibrium in red and pink, when 
(i) large punishment costs are informative enough that the expected payoff 
of accepting a Signaller with a large cost is negative (I;p > b/c); and (ii) 
small punishment costs are informative enough that the expected payoff 
of accepting a Signaller with a small cost is positive (Isp > c/b). It is the 
unique equilibrium in red when also (iii) the expected payoff of accepting 
a random Signaller is negative (c/b > 1). We also see pooling equilibria 

in which Signallers never punish and Choosers always accept (in white 
and pink) or reject (in grey). For visualization, we fix the probability that 
an exploitative Signaller has a small punishment cost at 0.1, and vary 

the probability that a trustworthy Signaller has a small punishment cost 
(showing the resulting Isp). b, Equilibria when both signals are possible. 
We consider parameters where punishment-signalling and helping- 
signalling are the unique equilibria in scenarios 1 and 2, respectively 
(fixing c/b = 2, and showing Isp and Is; values >2). In scenario 3, an 
only-helping strategy profile, in which Signallers help when their helping 


maintains variation (see Supplementary Information section 3.1). 
This process can describe genetic evolution, as well as social learning 
whereby people imitate successful others. 

We first consider scenario 1, where Signallers have the opportunity 
to punish but not help. Here a punishment-signalling strategy profile 
(in which Signallers punish when experiencing small signalling costs, 
and Choosers only accept Signallers who punish) can be an equilibrium 
when punishment is sufficiently informative: that is, when trustworthy 
types are sufficiently more likely to receive small punishment costs, and 
less likely to receive large punishment costs, than exploitative types (see 
Fig. 1a for precise conditions). Thus, we confirm that TPP can signal 
trustworthiness when it is the only available signal. By symmetry, the 
same is true for helping when it is the only available signal (scenario 2). 
See Supplementary Information section 2.2 for details. 

What, then, happens in scenario 3 when TPP and helping are both 
possible? If helping is more informative, TPP may be ignored. To see 
why, consider a Signaller who punishes but does not help. If she did 
not have the opportunity to help, her choice to punish conveys positive 
information, and a Chooser might accept her. However, if helping was 
possible, her choice not to help conveys negative information—and 
when not helping is informative enough to outweigh the positive effect 
of punishing, the same Chooser might reject her. 
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costs are small but never punish, and Choosers demand helping but ignore 
punishment, is an equilibrium in blue and light purple, when (i) large 
helping costs are sufficiently more informative than small punishment 
costs that the expected payoff of accepting any Signaller with a large 
helping cost is negative, even if she also has a small punishing cost 

(It / Isp > b/c). It is the unique equilibrium in blue, when also (ii) small 
helping costs are sufficiently more informative than large punishment 
costs that the expected payoff of accepting a Signaller with a small 

helping cost remains positive even if she also has a large punishing cost 
(Isu / Ip > c/b). When helping and punishment are similarly informative 
(around the diagonal), we also see equilibria in which Choosers treat them 
equally, demanding helping or punishment (dark purple), or helping and 
punishment (light purple). For visualization, we vary the informativeness 
of both punishment and helping as in a, showing the resulting Isp and 

Isp; values. c, Evolutionary dynamics when punishment is moderately 
informative and helping is increasingly more informative. Here we use 
agent-based simulations and fix Isp = 4, I;p= 1.5 and c/b = 2, such that 
when punishment is the only signal (red triangles), it is favoured by 
evolution (regardless of how informative helping is, because helping is not 
possible in that scenario). We then vary the informativeness of helping as 
in b. We see that when both signals are available, evolution increasingly 
disfavours punishment as helping becomes increasingly informative (red 
circles). In contrast, Signallers help at high rates regardless of whether 
helping is the only signal (blue triangles) or both signals are available 
(blue circles). We plot Signallers’ probabilities of sending each signal when 
the cost is low; for scenario 3, we plot the average of the probabilities of 
punishing (helping) when (i) only the punishing (helping) cost is low, and 
(ii) both costs are low. See Extended Data Fig. 2 for Chooser strategies and 
disaggregated Signaller strategies. 


To formalize this argument, we vary the informativeness of TPP 
and helping in scenario 3. We focus on the parameter region where 
both TPP and helping are informative enough to serve as signals on 
their own (that is, punishment-signalling and helping-signalling are 
the unique equilibria in scenarios 1 and 2, respectively; everywhere in 
Fig. 1b). We find that when the informativeness of the two signals is 
sufficiently similar, there are equilibria in which Signallers are equally 
likely to engage in TPP and helping, and Choosers equally demand 
TPP and helping. However, as the informativeness of helping increases, 
and/or the informativeness of TPP decreases, the unique equilibrium 
becomes an only-helping strategy profile in which helping is signalled 
and demanded, and TPP is ignored. Specifically, only-helping becomes 
the unique equilibrium when Choosers receive (i) a positive expected 
payoff from accepting any Signaller with a small helping cost (even if 
she has a large punishing cost), and (ii) a negative expected payoff from 
accepting any Signaller with a large helping cost (even if she has a small 
punishing cost). See Fig. 1b for precise conditions. 

Critically, then, there are parameter regions in which it is an equi- 
librium to punish (and condition partner choice on punishment) in 
scenario 1 but not in scenario 3. Evolutionary dynamics show that as a 
result, TPP can evolve as a costly signal that is preferentially used when 
helping is not possible (Fig. 1c and Extended Data Fig. 2). 
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Our model thus makes clear predictions. First, when TPP is the only 
possible signal, it should be perceived as, and should actually be, an 
honest signal of trustworthiness. Second, when a more informative 
signal (for example, helping) is also available, third parties should be 
less likely to punish, and the perceived and actual signalling value of 
TPP should be attenuated. Third, the same should not be true of help- 
ing, which should continue to serve as a strong signal even when TPP 
is possible. 

We next test these predictions in a two-stage economic game con- 
ducted using Amazon Mechanical Turk in which TPP and helping 
signals can be sent, and then partner choice occurs (Extended Data 
Fig. 3 illustrates the experimental setup, and Supplementary 
Information section 4 discusses the link between our theoretical and 
experimental setups). As in our model, there are two roles in this game: 
Signaller and Chooser. 

In the first stage, the Signaller participants in a TPP game! (TPPG), 
interacting with people other than the Chooser. In the TPPG, a Helper 
decides whether to share money with a Recipient, and an unaffected 
Punisher decides whether to pay to punish the Helper if the Helper 
is selfish. To investigate the three scenarios from the model in which 
helping, punishment or both are available as signals, we manipulate 
whether the Signaller participates in the TPPG as the Helper, Punisher 
or both (playing twice with two different sets of other people). 

The second stage captures the psychology of partner choice using 
a trust game (TG). Here, both the Signaller and Chooser participate. 
The Chooser first decides how much of an endowment to send to the 
Signaller; any money sent is tripled. The Signaller then decides how 
much to return to the Chooser. The Chooser can condition her sending 
on the Signaller’s behaviour in the TPPG—and the Signaller knows this 
when deciding how to behave in the TPPG. 

Overall, therefore, our experiment is designed to include oppor- 
tunities to signal via TPP and/or helping, and to make helping more 
informative than TPP (see Supplementary Information section 5.1 for 
further discussion). 

The results confirm our theoretical predictions. First, in the 
punishment-only condition (where punishment is the only available 
signal, n = 397 Signaller-Chooser pairs), punishment is perceived by 
Choosers as a signal of trustworthiness: Choosers trust Signallers who 
punish in the TPPG more than those who do not (sending 16 percentage 
points more to punishers than non-punishers, P < 0.001, Fig. 2a). 
Furthermore, punishment actually is an honest signal of trustworthi- 
ness: Signallers who punish return significantly more in the TG than 
non-punishers (returning 8 percentage points more, P=0.001, Fig. 2b). 
P values generated using linear regression with robust standard errors; 
see Supplementary Information section 5.2. 

Second, in the punishment-plus-helping condition (where help- 
ing is also possible, n = 393 Signaller-Chooser pairs), Signallers use 
punishment less than in the punishment-only condition: only 30% 
of Signallers punish in punishment-plus-helping, compared to 41% 
in punishment-only (P=0.002, Fig. 2c). Furthermore, providing the 
option to help attenuates the perceived and actual signalling value of 
punishment: in the punishment-plus-helping condition, controlling 
for helping, Choosers only trust punishers slightly more than non- 
punishers (4 percentage points more sent to punishers than non- 
punishers, P= 0.004, Fig. 2a), and Signallers who punish in the TPPG 
do not return significantly more in the TG than non-punishers (0.3 
percentage points less returned by punishers than non-punishers, 
P=0.900, Fig. 2b). Thus the effects of punishment on trust and trust- 
worthiness are significantly smaller in punishment-plus-helping than 
punishment-only (interactions: P< 0.001 and P=0.016, respectively). 
See Supplementary Information section 5.2 for details. 

Third, in the helping-only condition (n = 409 Signaller-Chooser 
pairs), just as many Signallers help (81%) as in the punishment- 
plus-helping condition (82%) (P=0.650, Fig. 2c). Furthermore, 
in both conditions, Choosers preferentially trust Signallers who 
help (39 percentage points more sent to helpers in helping-only, 
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Figure 2 | Behavioural experiments confirm key predictions of our 
model. a, TPP is perceived as a signal of trustworthiness. In the TG, 
Choosers trust Signallers who punished in the TPPG more than non- 
punishers. This effect is much larger, however, when punishment is the 
only available signal (punishment-only) than when both punishment and 
helping are possible (punishment-plus-helping) (left column). In contrast, 
Choosers trust Signallers who helped in the TPPG much more than non- 
helpers, regardless of whether punishment is possible (right column). 

b, TPP actually is a signal of trustworthiness. Signallers who punished 

in the TPPG return more in the TG than non-punishers, but this 

effect is much larger when Signallers did not have the chance to help 
(punishment-only) than when they did (punishment-plus-helping) (left 
column). In contrast, Signallers who helped in the TPPG return more than 
non-helpers, regardless of whether they had the opportunity to punish 
(right column). c, TPP is used as a signal of trustworthiness. Signallers 
are more likely to punish when they do not have the opportunity to 

help (punishment-only) than when they do (punishment-plus-helping) 
(left column). In contrast, they are just as likely to help when they do 

not have the opportunity to punish (helping-only) as when they do 
(punishment-plus-helping) (right column). a and b show predicted mean 
trust and trustworthiness, respectively (in punishment-plus-helping, 
regression models include terms for both punishment and helping, but 
not their interaction, as there were no significant punishment-by-helping 
interactions); c shows predicted probability of sending the relevant signal 
from logistic regressions including condition. Error bars are + 1 s.e.m. 


P<0.001; 37 percentage points more sent in punishment-plus-helping 
(controlling for TPP), P< 0.001, Fig. 2a), and Signallers who 
help are more trustworthy (25 percentage points more returned by 
helpers in helping-only, P < 0.001; 22 percentage points more returned 
in punishment-plus-helping (controlling for TPP), P< 0.001, Fig. 2b). 
These differences between conditions are not significant (interactions: 
P=0.539 and P=0.623, respectively). Thus, while helping attenuates 
the signalling value of TPP, TPP does not attenuate the signalling value 
of helping. 

These results offer clear support for our model of TPP as a costly sig- 
nal of trustworthiness. We therefore provide evidence that people may 
punish to provide information about their character to observers, rather 
than just to harm defectors or deter selfishness. This theory helps to 
reconcile conflicting previous experimental results about whether TPP 
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confers reputational benefits. Our conclusion that the signalling value 
of TPP is mitigated when more informative signals of trustworthiness 
are also available explains why a large positive effect of punishment on 
trust was found in one experiment in which helping information was 
absent’, while little effect was found in another experiment in which 
helping was observable!®. This conclusion also provides an explanation 
for why TPP and trustworthiness were found to be uncorrelated in an 
experiment in which both punishment and helping were possible”’. 
Finally, our theory also explains why participants preferred punishers 
as partners to a greater extent in situations in which participants could 
benefit from choosing a prosocial partner’. 

Our results cannot be explained by the alternative theory that TPP 
is perceived as a signal of willingness to retaliate when harmed directly 
(although TPP may signal retaliation in other contexts), because retali- 
ation is not possible in the TG. Even if Choosers sent more to punishing 
Signallers out of an ‘irrational fear of retaliation, helping information 
should not attenuate this effect (as helping is unlikely to be a more 
informative signal of retaliation than TPP). Furthermore, an additional 
experiment (Extended Data Fig. 4 and Supplementary Information sec- 
tion 6) finds that TPP elicits larger reputational benefits when stage 2 
isa TG than an ultimatum game (where signalling retaliatoriness is 
advantageous). 

Importantly, punishers need not be consciously seeking to signal 
their trustworthiness; at a proximate level, TPP may be motivated 
by emotions like moral outrage’. Thus, TPP may be based on social 
heuristics”® rather than explicit reasoning, and is unlikely to be per- 
fectly sensitive to context—signalling motives may ‘spill over’*® to 
settings where TPP cannot function as a signal (for example, anon- 
ymous interactions, or settings in which engaging with trustwor- 
thy Signallers is not actually advantageous to Choosers, such as the 
Dictator Game?’). 

Relatedly, while our model assumes that different types of individuals 
have different costs of TPP and helping, and different optimal responses 
to being trusted, our experiments do not vary subjects’ payoffs of pun- 
ishing, helping and being trustworthy. Instead, the experiments tap into 
participants’ pre-existing inclinations to punish, help and reciprocate 
the trust of others, reflecting the incentives experienced in daily life”. 
Thus, because we do not exactly recreate the model in the laboratory, 
our results are consistent with the idea that the model operates outside 
of the laboratory (rather than merely showing that participants can 
reason strategically about a novel game). 

In sum, we help answer a fundamental question regarding human 
nature: why do humans care about selfish behaviours that do not affect 
them personally? Although TPP may often appear ‘altruistic, we show 
how punishing can be self-interested in the long-run because of repu- 
tational benefits. Sometimes punishing wrongdoers is the best way to 
show that you care. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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Extended Data Figure 1 | Agent-based simulations from our second 
microfoundation model in which gaining interaction partners reduces 
TPP costs. TPP evolves over time in this modified model, in which a 
Signaller’s punishment costs are endogenous (decreasing in the number 
of times she has been accepted as a partner), rather than exogenously 
fixed as lower for trustworthy types. We use parameters similar to the 
main text agent-based simulations, where punishment is moderately 
informative and helping is more informative. Shown is the average over 


500 simulations of Signallers’ average probability of helping and punishing 
(when experiencing the small signalling cost) in each generation, as well 
as the expected probability of experiencing the small punishing cost for 
trustworthy and exploitative types (based on the average number of times 
trustworthy and exploitative types were chosen as partners) at the end 

of each generation. See Supplementary Information section 1.3.2 fora 
detailed description of our second microfoundation model. 
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a. Only punishment: Signaller strategies 


b. Only helping: Signaller strategies 


c. Both signals: Signaller strategies 
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Extended Data Figure 2 | Full agent-based simulation results from the 
main text model. Here, we present the Signaller and Chooser strategies for 
each scenario from our main model agent-based simulations, a summary 
of which is shown in Fig. 1c. In scenario 1, when only punishment is 
possible, punishment-signalling evolves, regardless of the informativeness 
of small helping costs Isq. a, Signallers are likely to punish when the 
punishment cost is small and b, Choosers are likely to accept Signallers 
who punish, while they almost always reject those who do not. In scenario 2, 
when only helping is possible, helping-signalling evolves, and becomes 
stronger as Is; increases. c, Signallers are increasingly likely to help 

when the helping cost is small and d, Choosers are increasingly likely to 
accept Signallers who help, while they almost always reject those who do 
not. In scenario 3, when both signals are available, agents evolve to use 
both signals with equal frequency when they are equally informative, 


Informativeness of small helping costs (I 


Informativeness of small helping costs (I 


su) sh) 


but to favour helping as Isr; increases. e, As Isy increases, Signallers are 
increasingly likely to help, both when they have only a small helping cost 
(light blue dots), and when they have both small costs (dark blue dots); 
and are decreasingly likely to pay to punish, both when they only have a 
small punishing cost (light red dots), and when they have both small costs 
(dark red dots). f, As Isr increases, Choosers are increasingly likely to 
accept Signallers who help but do not punish (blue dots), and increasingly 
likely to reject Signallers who punish but do not help (red dots). 
Furthermore, regardless of Is, Choosers almost always reject Signallers 
who neither help nor punish (brown dots). However, Chooser behaviour 
in response to Signallers who both punish and help (purple dots) stays 

at chance levels across all values of Is; (because Signallers never send 
both signals, and thus Choosers do not face selection pressure to respond 
optimally to such Signallers). 
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Extended Data Figure 3 | Our two-stage experimental design involving 
Signallers and Choosers. First, in the signalling stage, the Signaller 
participates in a third-party punishment game (TPPG). Here a Helper 
decides whether to share with a Recipient, and then a third-party Punisher 
decides whether to pay to punish the Helper if the Helper was selfish 
(chose not to share). In our three experimental conditions, we manipulate 
the role(s) the Signaller plays in the TPPG. In the punishment-only 
condition, the Signaller plays once as the Punisher; in the punishment- 
plus-helping condition, the Signaller plays twice (with two different sets 


Signaller 


of other people) as the Punisher and the Helper; in the helping-only 
condition, the Signaller plays once as the Helper. Thus we vary which 
signal(s) are available. Second, in the partner choice stage, the Chooser 
plays a trust game with the Signaller. The Chooser decides how much to 
send the Signaller and any amount sent is tripled by the experimenter. 

The Signaller then decides how much of the tripled amount to return. 
Choosers use the strategy method to condition their sending on Signallers’ 
TPPG decisions. 
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Extended Data Figure 4 | Third-party punishment is perceived as a 
stronger signal of trustworthiness than retaliation in our additional 
experiment (study 2). In our additional experiment, we manipulate 
whether the second stage of our game is a trust game (TG) or an 
ultimatum game (UG). In the TG, Choosers maximize their payoffs by 
sending more money to trustworthy Signallers (who will return a large 
amount); thus, preferential sending to punishers reflects expectations 
of punisher trustworthiness. In this game (left bars), punishment has 
large reputational benefits: replicating study 1, Choosers (n = 405) send 
16 percentage points more to punishers than non-punishers, P < 0.001. 
In the UG, Choosers (n = 421) maximize their payoffs by sending more 
money to retaliatory Signallers (who are willing to pay the cost required 


to reject low offers); thus, preferential sending to punishers reflects 
expectations of punisher retaliation. In this game (right bars), punishment 
has smaller reputational benefits: Choosers send 3 percentage points more 
to punishers than non-punishers, P= 0.001. This difference between 
conditions is significant (P < 0.001) and robust to accounting for the 

fact that there is less overall variance in UG offers than TG transfers 

(see Supplementary Information section 6). Thus TPP is perceived as a 
stronger signal of trustworthiness (in the TG) than willingness to retaliate 
(in UG). These findings provide further evidence that our TG experiment 
results (study 1) are not driven by a perception that TPP signals retaliation 
(although TPP may also signal retaliation in other contexts). Shown is 
mean sending in each game. Error bars are + 1 s.e.m. 
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A simple rule governs the evolution and 
development of hominin tooth size 


Alistair R. Evans!, E. Susanne Daly, Kierstin K. Catlett?*, Kathleen S. Paul*, Stephen J. King®, Matthew M. Skinner”*®, 
Hans P. Nesse*, Jean-Jacques Hublin®, Grant C. Townsend’, Gary T. Schwartz** & Jukka Jernvall'® 


The variation in molar tooth size in humans and our closest 
relatives (hominins) has strongly influenced our view of human 
evolution. The reduction in overall size and disproportionate 
decrease in third molar size have been noted for over a century, 
and have been attributed to reduced selection for large dentitions 
owing to changes in diet or the acquisition of cooking’”. The 
systematic pattern of size variation along the tooth row has been 
described as a ‘morphogenetic gradient’ in mammal, and more 
specifically hominin, teeth since Butler? and Dahlberg*. However, 
the underlying controls of tooth size have not been well understood, 
with hypotheses ranging from morphogenetic fields’ to the clone 
theory’. In this study we address the following question: are there 
rules that govern how hominin tooth size evolves? Here we propose 
that the inhibitory cascade, an activator-inhibitor mechanism that 
affects relative tooth size in mammals’, produces the default pattern 
of tooth sizes for all lower primary postcanine teeth (deciduous 
premolars and permanent molars) in hominins. This configuration 
is also equivalent to a morphogenetic gradient, finally pointing to 
a mechanism that can generate this gradient. The pattern of tooth 
size remains constant with absolute size in australopiths (including 
Ardipithecus, Australopithecus and Paranthropus). However, in 
species of Homo, including modern humans, there is a tight link 
between tooth proportions and absolute size such that a single 
developmental parameter can explain both the relative and absolute 
sizes of primary postcanine teeth. On the basis of the relationship 
of inhibitory cascade patterning with size, we can use the size at one 
tooth position to predict the sizes of the remaining four primary 
postcanine teeth in the row for hominins. Our study provides a 
development-based expectation to examine the evolution of the 
unique proportions of human teeth. 

Nearly 80 years ago, Butler*’ described the morphogenetic gradi- 
ent in mammalian postcanine teeth. From anterior to posterior, the 
deciduous premolars and molars increase in size, and in some species 
the posterior molars then decrease, with only one local maximum of 
tooth size along the row. Butler? interpreted this pattern to be gener- 
ated by a morphogenetic field, where the concentration of a diffusible 
morphogen determined size. The pattern appeared to apply both to 
deciduous premolars and to molars, which together are considered 
primary teeth’. Unlike molars, deciduous premolars are replaced with 
a secondary dentition, called the permanent premolars. While several 
authors have investigated the morphogenetic gradient in hominins*”, 
they have generally investigated permanent premolars rather than their 
deciduous predecessors. 

In 2007, a developmental mechanism controlling relative molar size 
in mice either by separating adjacent molars or by applying growth 
factors in the culture was experimentally discovered®. In the resulting 
‘inhibitory cascade’ model, molar activator/inhibitor ratio determines 


the size of subsequently developing molars. Whereas activation is prin- 
cipally considered to be mesenchymal, previously initiated molars are 
the source of inhibition, thereby causing a patterning cascade from 
anterior to posterior molars. The model appears to explain a high pro- 
portion of the variation in relative molar size in murines, primates and 
fossil mammaliaforms®!°-!°, Mice, however, lack all premolars, but 
the inhibitory cascade implies that a previously initiated tooth should 
always inhibit the subsequently developing tooth (for example, the 
fourth deciduous premolar, dp4, should inhibit the first molar, m1). 

Here, we test whether the inhibitory cascade explains the mor- 
phogenetic gradient in the primary postcanine tooth size of homin- 
ins and great apes. We partition the lower dentition into triplets: 
(1) the third and fourth deciduous premolars, dp3 and dp4, and the first 
molar, m1 (dp3—dp4—m1); (2) dp4—m1-m2; and (3) the three molars 
(m1-m2-m3). Ifa triplet follows the inhibitory cascade pattern, then 
the central tooth is the average size of the two outer teeth. This is math- 
ematically equivalent to the central tooth being one-third of the total 
triplet size, a manifestation of the inhibitory cascade® (Supplementary 
Information). As a result, the three teeth show a linear change in size 
with tooth position; hence, linearity of size change is a proxy for the 
inhibitory cascade. 

Our analysis of 58-66 modern human populations for lower 
molars and 8 populations for lower deciduous premolars shows a 
linear increase of the average sizes of the first triplet (dp3-dp4—m1; 
ordinary least squares (OLS) regression R?=0.9998; Fig. 1). The third 
triplet (molars) also follows the inhibitory cascade pattern, but here 
size decreases linearly from m1 to m3 (R?=0.974). On average, m1] is 
the largest tooth in the row, with size first increasing and then decreas- 
ing about this central tooth position. The second triplet dp4—m1-m2 
does not follow the linear pattern predicted by the inhibitory cascade 
because the middle tooth is the largest. We call this change in direction 
a reversal of the inhibitory cascade patterning. 

Fourteen species of fossil hominins (eight with data on both deciduous 
premolars) also follow the inhibitory cascade in the first triplet (Fig. 1 
and Extended Data Fig. 1). The close fit of the dp3-dp4—m1 triplet for 
hominins allows us to predict that the mean size of the undiscovered 
dp4 of Ardipithecus ramidus will be the average of the dp3 and m1 
sizes, that is, 73 mm? in area (star in Fig. 1a). In all extinct hominins 
the second or third molar is the largest tooth on average. In most aus- 
tralopiths (for example, Paranthropus boisei; Fig. 1) the second triplet 
(dp4—m1-m2) also follows the inhibitory cascade, as the m1 is the 
average of the two adjacent teeth, pushing the reversal position to m2 
or m3. This contrasts with a reversal position at m1 in Homo sapiens. 

Here we used a simple measure of tooth size, length by width rectan- 
gular area, because it is the most commonly used and, therefore, exten- 
sive data sets are available. To assess alternative measures of size we 
calculated three additional metrics from micro-computed tomography 
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Figure 1 | All hominins show the inhibitory cascade pattern for 
dp3-dp4-m1 triplet, but species of Homo show greater reduction in size 
of posterior molars. a, Area (mediodistal length x buccolingual width) 

of each lower postcanine primary tooth for 7 of the 15 hominin species in 
this study. The inhibitory cascade predicts a linear relationship of the sizes 
of three adjacent teeth, as seen for dp3-dp4-m1 triplet and dp4-m1-m2 
triplet for P boisei. Red dotted line shows expected linear relationship 

for dp3—dp4-m1 triplet for Ar. ramidus; red star shows predicted size of 
undiscovered dp4 (73 mm’). Mean +s.e.m. of populations for H. sapiens 
(dark blue), and of individuals for fossil hominin species. b, Measurements 
of tooth area used in this study illustrated on H. erectus Sangiran 1B: 
mesiodistal length x buccolingual width (the principal measure used in 
the analyses), 3D enamel-dentine junction area, 2D crown area and 2D 
cervix area. 
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scans using a subset of fossil hominin specimens: tooth occlusal outline 
area, enamel-dentine junction 3D surface area, and cervical cross- 
sectional area (Fig. 1b). All show the same general pattern of size rela- 
tionships (Extended Data Fig. 2). The first two of these were very highly 
correlated with rectangular area (R? > 0.94), cervical area only slightly 
less so (R* = 0.86; Extended Data Fig. 3). 

Expressing the relative size of each tooth in a row as a proportion of 
the largest tooth in the row reveals a close relationship between abso- 
lute m1 size and relative tooth size for Homo species (Extended Data 
Fig. 4). This contrasts with the remaining hominin taxa (that is, the 
australopiths), where the proportions are essentially constant with m1 
size. Great ape tooth proportions are intermediate to Homo species 
and australopiths. 

Ina 3D plot combining tooth position, the relative size of each tooth 
and the absolute size of m1, all data points generally fall on two dis- 
tinct planes in 3D space (meaning that relative size has linear relation- 
ships both with m1 size and with tooth position; Extended Data Fig. 5 
and Supplementary Videos 1 and 2). The tight fit of the teeth to both 
the anterior plane A and the posterior plane B reveals two important 
findings: (1) these species follow the expected inhibitory cascade pat- 
tern of linear change for the first three (or four) teeth in the primary 
tooth row; (2) there is a strong relationship between tooth position 
and proportional sizes in all hominins. In Homo the reversal position 
(intersection between planes A and B) changes with absolute m1 size, 
while in the australopiths the reversal position is constant with absolute 
ml size. Using these planar relationships, we can convert proportional 
size to absolute size based on a given m1 size (Extended Data Fig. 6; 
Supplementary Information). Standardizing m1 size to a straight line in 
3D space produces 3D surfaces that can represent primary postcanine 
tooth sizes for australopiths or Homo species (Fig. 2). This diagram 
essentially shows how absolute tooth sizes and proportions vary with 
changes in m1 size. Deviations of the species means from these surfaces 
are relatively minor (average and maximum deviation for Homo species 
are 7.1% and 18.0% respectively; for australopiths, 7.6% and 24.5%; 
Extended Data Fig. 7). 

For a given tooth position and size, Supplementary Spreadsheet 1 
predicts the sizes of the remaining positions in the primary postcanine 
row. These predictions are based on species means and interspecific 
scaling relationships, and so represent sizes and proportions typical of 
species for that m1 size. Figure 3 illustrates the 3D prediction surfaces 
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Figure 2 | Prediction surfaces for hominin tooth sizes based on 
inhibitory cascade and scaling of inhibitory cascade reversal with m1 
size. Tooth area (vertical axis) for each tooth position (dp3-m3) and 

area of m1. Species mean tooth areas (spheres) and prediction surface for 
Homo species are plotted in blue, and australopiths in red. Vertical lines 
connecting spheres to surface show deviation of the species means from 
predicted size. Areas are in square millimetres. See Supplementary Video 5 
for 3D rotating graph animation. 


as contour plots for Homo species and australopiths, showing the sizes 
at each tooth position for a given m1 size in that row (see also Extended 
Data Fig. 8). 

Given the close fit of almost all australopiths to their prediction sur- 
face (Fig. 2) and a very similar pattern in the great apes, we postulate 
that a tight association between tooth proportions and m1 size existed 
at the base of the hominin clade (Extended Data Figs 1, 4 and 5). Near 
the origin of the genus Homo, a change in the scaling relationship 
between m1 size and inhibitory cascade patterning occurs, such that 
the reversal position changes with absolute m1 size and shifts mesially 
(Fig. 4). Interestingly, Homo habilis shows proportions more similar to 
australopiths than its congeners, which suggests this shift occurred after 
the origin of H. habilis, and agreeing with H. habilis perhaps belonging 
to the genus Australopithecus'®. The Dmanisi specimens demonstrate 
the heterogeneity present in the early Homo pattern: one specimen 
(D2600) resembles the australopith pattern whereas the others are 
closer to the Homo pattern (Supplementary Information). 

The smallest-bodied hominin, Homo floresiensis, is most similar to 
the smallest-bodied great ape, Pan paniscus, in tooth sizes and propor- 
tions and is smaller than tooth rows of human populations with the 
largest tooth at m2 (Extended Data Fig. 1). Among fossil hominins, 
H. floresiensis exhibits proportions similar to Homo heidelbergensis 
and Homo erectus, but ~40% smaller in absolute area. The recently 
described Australopithecus deyiremeda"’ and Australopithecus sediba 
both follow the australopith pattern. 

H. sapiens is similar to its congeners but at smaller m1 sizes, produc- 
ing disproportionately reduced third molars. For most modern humans 
m1 is the largest tooth, but a small proportion of humans have m2 as 
the largest tooth (11-19% of individuals in some populations'’, and 
8.6% of the population means in our sample). 

We note that specimen-level predictions can deviate from the 
species-level scaling patterns. When the size of each tooth is used to 
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Figure 3 | Hominin prediction plot for primary postcanine rows. 

a, c, Contour plot of prediction surfaces in Fig. 2. Contours show tooth 
areas in square millimetres. Red contours are for australopiths (a), blue 
for Homo species (c). To predict the mean sizes of remaining four teeth 
in a row from the mean size of one tooth position (for example, an 
australopith m1 of 250 mm’), start at the known tooth’s position (x axis), 
move vertically (orange arrow) to the red contour of the measured size, 


predict the other teeth in the same specimen, the average error is 10.3% 
and 7.9% for Homo and australopiths, respectively (Supplementary 
Information). The largest prediction errors (above 30%) are found 
in H. erectus (D211, Thomas I), H. heidelbergensis (Arago 1) and 
H. neanderthalensis (KMH1, Krapina 1,7,79 and Krapina 64). Some 
of the discrepancies are probably related to errors in size estimation 
owing to developmental age, preservation and wear, and potential 
identification inaccuracies in making composite specimens. Increased 
intraspecific variation in Homo species could also result from relaxed 
selection, implicated by the increased fluctuating asymmetry found in 
most Homo dentitions compared with australopiths and great apes!®”°, 
or multiple selection pressures. Another possibility is developmental 
instability when molars in a tooth row approach equal size: for example, 
extra molar presence in mouse experiments seems linked to the molars 
being of equal size®. 


then move horizontally (green arrows) to the other tooth positions (green 
crosses) and read off sizes from the red contours. Asterisks and dashed 
lines indicate position of inhibitory cascade reversal. b, Predicted tooth 
row sizes for mean m1 areas of 175 and 250 mm’ for australopiths and 
125 mm? for Homo, which are similar in size to the species listed below 
each tooth row. Lines above tooth rows indicate triplets that follow the 
inhibitory cascade (deviation from inhibitory cascade less than 5%). 


Developmentally, in the activator—-inhibitor model, higher or 
lower relative inhibition yields smaller or larger posterior molars, 
respectively. Absolute tooth size was independent of the inhibitory 
cascade in the mouse-derived model (ref. 6 and Extended Data 
Fig. 9), whereas size and proportions are linked in hominins 
(Extended Data Fig. 4). The simplest way to incorporate size into the 
inhibitory cascade is through the activation, because the first tooth 
in a developmental series is unimpeded by the inhibition by other 
teeth. Therefore, we hypothesize a decrease in mesenchymal activa- 
tion drives the change in tooth proportions in Homo. The mechanism 
controlling reversal position, however, remains to be determined. 
While the inhibitory cascade has been invoked to account for pat- 
terns in vertebrate limbs, digits and somites”!, a link between abso- 
lute size and patterning in multiple vertebrate systems remains to be 
established. 
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Figure 4 | Phylogenetic distribution of tooth sizes and proportions in 
hominins shows an origin of the Homo pattern shortly after the origin 
of the genus. Tooth sizes plotted to scale for all species in the current 
study. Supplementary Spreadsheet 1 was used to predict measurements 
for unavailable tooth positions. The largest tooth for the species mean 

is filled black, as is any other tooth that is within 5% of the size of the 


largest tooth. The largest tooth/teeth for fossil individuals with all three 
molars preserved are indicated with a white circle. The phylogeny is 
modified from ref. 28 to include only taxa represented in this study, with 
the addition of P paniscus, Pongo pygmaeus and Au. deyiremeda (***) 
(Supplementary Information). 


25 FEBRUARY 2016 | VOL 530 | NATURE | 479 


© 2016 Macmillan Publishers Limited. All rights reserved 


LETTER 


In conclusion, previous work comparing relative molar sizes using 
step indices”* and ratios”? identified significant changes in these pro- 
portions throughout hominin evolution; here we explain such changes 
based on the developmental inhibitory cascade mechanism. Whereas 
selective pressures emphasizing function, such as changing bite force, 
have been used to explain the variation in tooth proportions”*’, only 
by including development can one explain the details of the changes. 
By providing a development-based expectation for the evolution of 
the hominin dentition, the inhibitory cascade framework moves this 
research towards a predictive science, further testable with additional 
fossils. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Tooth size data. Human lower primary postcanine tooth sizes, defined as rectan- 
gular areas of mesiodistal length x maximum buccolingual width, were collated 
from the literature, and measured from dental casts at the Burlington Growth 
Centre, Toronto, Canada. All research protocols were reviewed and granted 
exemption by Arizona State University’s Institutional Review Board and the 
Burlington Growth Centre, and informed consent was obtained for all human 
subjects. Population-level means of molar sizes were collated from studies where 
all three molars were measured and wear was not excessive, largely from compi- 
lations**”? (see Supplementary Information). No statistical methods were used 
to predetermine sample size. Fossil hominin tooth size measurements were com- 
piled at the specimen level and assigned to taxonomic groups (see Supplementary 
Information), using measurements adjusted for wear when available. H. erectus 
specimens from Asia and Africa were grouped separately. Ape tooth size measure- 
ments were obtained from the literature*” or measured from museum specimens 
(see Supplementary Information). For all individual-level data, measurements 
taken from both sides of the same individual were averaged. Individuals of the 
same sex were averaged, then sexes were averaged to obtain a population or species 
mean size for each tooth. 

Analyses. For each primary tooth row (mean of sex, population or species), all 
teeth were scaled so that the largest tooth in the row equalled 1, called Prop Max 


LETTER 


in Row. For hominins and great apes separately, we used R version 3.1.3 (ref. 31) to 
perform multiple linear regression for Prop Max in Row ~ Tooth Position + Area 
of m1, and linear regression for Max Area in Row ~ Area of m1. For hominins, 
Prop Max in Row was converted to Area by multiplying it by Max Area in Row 
for the given m1 size. This function was then divided by the ratio of the expected 
Area of m1 to that calculated for m1 to give m1 size a 1:1 relationship with Area 
in the predictive surface (see Supplementary Information and Extended Data 
Fig. 10 for calculations and plots). Individual-level fossil data were used for the pre- 
diction error calculation using the relevant Homo or australopith prediction sur- 
face. Prediction error was calculated as 100 x |(observed — predicted) |/observed. 
Three-dimensional tooth size data. X-ray microtomographic scans of six hominin 
specimens with three sufficiently unworn lower molars were performed at the Max 
Planck Institute for Evolutionary Anthropology, Leipzig, Germany. Three-dimensional 
tooth models were measured for rectangular area, tooth occlusal outline area, enamel- 
dentine junction 3D surface area, and cervical cross-sectional area (Fig. 1b). 
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Extended Data Figure 1 | Homo species and australopiths differ in 

their pattern of tooth sizes, but all hominins and great apes follow 

the inhibitory cascade for dp3-dp4-m1 triplet. The inhibitory cascade 
predicts that there is a linear relationship among three adjacent teeth. Area 
(in square millimetres) of each lower postcanine primary tooth. a, Mean 
area of each tooth for 15 hominin species. b—e, Red points and lines are 
species means. b, H. sapiens; black points and lines represent means of 
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populations. c, Eight australopith species and (d) six fossil Homo species; 
black points and lines represent individual tooth rows (left and right rows 
of each specimen plotted separately). e, Four great ape species; black points 
and lines represent means of each sex. f, Two great ape species; black 
points and lines represent individuals, red points and lines are means for 
each sex. Sex and species means show clearer inhibitory cascade patterns 
than most individuals. 
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Extended Data Figure 2 | Two- and three-dimensional measures of tooth size for six fossil hominin specimens. Rectangular area (mesiodistal 
length x buccolingual width, MDBLArea), 3D area of the enamel-dentine junction (EDJ3DArea), cross-sectional area of the tooth at the cervix 
(CervixArea) and outline area of the outer enamel surface (OES2DArea) for each tooth position. 
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Extended Data Figure 4 | The proportional size of each tooth shows row) versus area of m1 (in square millimetres) for 15 hominin and 4 great 
a tight relationship with absolute size of the first molar, with the ape species. Blue triangles and solid line, OLS regression for Homo species; 
relationship differing between Homo species, australopiths and great red circles and dashed line, OLS for australopiths; yellow squares and 


apes. Proportional size of each tooth (proportion of the largest tooth in the dotted line, OLS for great apes. 
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Extended Data Figure 5 | See next page for figure caption. 
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Extended Data Figure 5 | Tooth proportions of hominins are 
constrained by the inhibitory cascade and size of m1. Three- 
dimensional space of tooth position (horizontal axis, numbered 1-5 for 
dp3-m3), area of m1 (axis into page) and proportion of maximum in 
tooth row (vertical axis). The proportional sizes of all teeth lie on two 
planes in 3D space. For all groups, plane A is fitted to dp3-dp4—m1, and 
plane B to m2-m3. a, Homo species, plane A (cyan; R? = 0.96) formula: 
HomoA PropMaxinRow = 0.238 x ToothPos — 0.00166 x AreaM1 + 0.441. 
Plane B (blue; R* = 0.62) formula: HomoBPropMaxinRow = —0.0822 x 
ToothPos + 0.000690 x AreaM1 + 1.23. Thick blue line shows intersection 
of planes. b, Australopiths, plane A (light red; R?=0.93) formula: 


AustAPropMaxinRow = 0.0810 x ToothPos + 0.230 x AreaM1 + 2.38 x 10~°. 


Plane B (dark red; R* = 0.07) formula: AustBPropMaxinRow = 0.00963 x 
ToothPos + 0.000168 x AreaM1 + 0.906. Thick red line shows intersection 
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of planes. c, Great apes, plane A (yellow; R*=0.98) formula: 
ApeAPropMaxinRow = 0.268 x ToothPos — 0.0727 x AreaM1 + 0.173. 
plane B (light brown; R? = 0.63) formula: ApeBPropMaxinRow = 
—0.0837 x ToothPos + 0.000337 x AreaM1 + 1.29. While the R? values 
are substantially lower for the plane B regressions, the average deviations 
from plane B for Homo and australopiths are 0.026 and 0.022 respectively, 
which are lower than the equivalent values of 0.046 and 0.036 for plane A. 
Therefore, the low R? values do not reflect the close fit of the data to the 
planes. d, Comparison of Homo, australopith and great ape planes shows 
that the corresponding planes and intersections for the first two groups 
diverge at smaller m1 sizes. The great ape planes fall between those of the 
other two groups. See Supplementary Videos 1-4 for 3D rotating graph 
animations. 
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Extended Data Figure 6 | The size of the largest tooth in the row is closely related to the size of the m1 in hominins. OLS regressions. 


HomoMaxArealInRow = 1.312 x AreaM1 — 30.44, P= 0.001, R? = 0.90; AustMaxAreaInRow = 1.298 x AreaM1 + 0.150, P=0.0003, R* = 0.90. 
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Extended Data Figure 7 | Percentage error in estimates of each tooth compared with the prediction surfaces in Fig. 2. Prediction surface is calculated 
so that m1 always has zero prediction error, therefore it is excluded from error calculations. a, Homo species; b, australopiths. 
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Extended Data Figure 8 | Detailed contour plot (contour step = 5 mm”) the measured size, then moving horizontally to the other tooth positions 
for prediction surfaces of hominin tooth size. Area of m1 and areas on (cyan line and crosses) to read off the sizes according to the contours. 
contour in mm”. Blue contours are for Homo species, red for australopiths. | When mean m1 size is 125 mm/?, dp3, dp4, m2 and m3 are 62, 93, 130 and 
From the mean size of one tooth position (for example, m1 at 125 mm’), 199 mm’? respectively for a Homo species and 50, 88, 156 and 158 mm? 
the mean sizes of the remaining four teeth in the row can be predicted by respectively for an australopith species. 
following the tooth position vertically (orange line) to meet the contour of 
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rodents in ref. 6. b, Relative size of third molar to first molar (m3/m1) 


© 2016 Macmillan Publishers Limited. All rights reserved 


LETTER 


PropArea 


Extended Data Figure 10 | Planes and surfaces for equations of tooth 
position T (horizontal) versus area of m1 AM1 (into page) versus 
proportion of area or area (vertical). a, Regression plane A (cyan) 

and plane B (green) with proportion of area PropArea as calculated in 
equations 2 and 3 in Supplementary Information. b, Surfaces AreaAH 
(cyan) and AreaBH (green) as calculated in equations 11 and 12 in 
Supplementary Information. The two regions that represent the data are 
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plane A or AreaAH on the left of the intersection and plane B or AreaBH 
on the right of the intersection of the two planes or surfaces, respectively. 
c, Prediction of m1 area using formulae for AreaAH (cyan) when T=3 
compared with the expected 1:1 relationship (black) using equation 11 in 
the Supplementary Information. If the cyan formula were standardized by 
the expected value (black), the standardized surface will correctly predict 
ml size (equation 17 in the Supplementary Information). 
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Adult restoration of Shank3 expression rescues 
selective autistic-like phenotypes 


Yuan Mei!*, Patricia Monteiro!?**, Yang Zhou', Jin-Ah Kim!, Xian Gao!*, Zhanyan Ful? & Guoping Feng)? 


Because autism spectrum disorders are neurodevelopmental 
disorders and patients typically display symptoms before the age 
of three’, one of the key questions in autism research is whether 
the pathology is reversible in adults. Here we investigate the 
developmental requirement of Shank3 in mice, a prominent 
monogenic autism gene that is estimated to contribute to 
approximately 1% of all autism spectrum disorder cases”~°. 
SHANK: is a postsynaptic scaffold protein that regulates synaptic 
development, function and plasticity by orchestrating the assembly 
of postsynaptic density macromolecular signalling complex’~’. 
Disruptions of the Shank3 gene in mouse models have resulted 
in synaptic defects and autistic-like behaviours including anxiety, 
social interaction deficits, and repetitive behaviour!*"13. We 
generated a novel Shank3 conditional knock-in mouse model, 
and show that re-expression of the Shank3 gene in adult mice led 
to improvements in synaptic protein composition, spine density 
and neural function in the striatum. We also provide behavioural 
evidence that certain behavioural abnormalities including social 
interaction deficit and repetitive grooming behaviour could be 
rescued, while anxiety and motor coordination deficit could not be 
recovered in adulthood. Together, these results reveal the profound 
effect of post-developmental activation of Shank3 expression on 
neural function, and demonstrate a certain degree of continued 
plasticity in the adult diseased brain. 

SHANKS3 (also known as ProSAP2) protein is a major scaffolding 
protein at the excitatory synapse, coordinating the recruitment of many 
postsynaptic signalling molecules'*’°. Previous work has shown that 
germline deletion of Shank3 in mice disrupts the protein composition 
at the postsynaptic density (PSD), reduces neurotransmission efficiency 
and leads to autistic-like behaviour!” ?. 

To address whether adult reversal of physiological and behavioural 
abnormalities in Shank3 mutant mice is possible, we adopted a genetic 
method that allows for inducible Shank3 expression. Because Shank3 
duplication is linked to attention deficit hyperactivity disorder and 
bipolar disorder**"*, it is crucial to keep Shank3 expression within 
its physiological concentrations to avoid potential confounds. Thus, 
we generated a novel Shank3 conditional knock-in mouse by using 
the Cre-dependent genetic switch (FLEx) strategy’, which enables 
the conditional manipulation of the Shank3 gene at its endogenous 
genomic locus. 

We genetically targeted the PDZ domain owing to its role in assem- 
bling the scaffold complex in the PSD!*’* (Fig. 1a and Extended Data 
Fig. 1). In the absence of Cre, the Shank3* mice function as Shank3 
knockout (KO) mice, and result in the deletion of most major isoforms 
of SHANK3 including the putative a, 3 and y bands (Fig. 1b). Similar 
to abnormalities previously reported on Shank3 knockout mice’*!3, 
these Shank3** mice showed significant deficits in exploratory behav- 
iour (Fig. 1c), anxiety (Fig. le) and motor coordination (Fig. 1d), and 


displayed open wound lesions that suggested repetitive grooming 
behaviour!”!®. In addition, they showed impaired neurotransmission 
in the dorsal striatum (Fig. 1f, g). These novel Shank3*“* mice thus 
recapitulate autistic-like phenotypes, enabling us to investigate the 
possibility of reversing those deficits in adulthood. 

To achieve temporal control of Shank3 expression, we crossed the 
Shank3* mice to an inducible CAGGS-CreER mouse line’? that 
activates global Cre function after tamoxifen treatment (Extended Data 
Fig. 1d). When the Shank3"™:CreER*’~ mice reached 24.5 months, 
we used oral gavage to deliver tamoxifen to ensure efficient 
Cre-mediated gene restoration (Fig. 2a). All experiments were per- 
formed on the following three groups: Shank3*/’:CreER™'~ mice 
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Figure 1 | Shank3* mice have deficits in neurotransmission and 
behaviour. a, Domain structure of SHANK3 protein, with FLExed PDZ 
domain inverted, which can be re-oriented in the presence of Cre. 

b, Western blot showing SHANK3 expression in striatal PSD from wild-type 
(WT) and Shank3/** (KO) mice. c, KO mice show decreased total distance 
travelled in the open-field test compared to WT mice. d, KO mice show 
impaired motor coordination in the rotarod test. e, KO mice spend less 
time in the open arm during the elevated zero maze test. f, KO mice 

show a decreased pop spike amplitude in extracellular field recordings 

in the striatum. g, Normal relationship of stimulation intensity to the 
negative peak 1 (NP1) amplitude (action potential component), suggesting 
unaltered presynaptic function; insets show representative traces. 
*P<0.05, **P< 0.01, ***P < 0.001 (two-tailed t-test (c, left, e) and two-way 
repeated measures analysis of variance (ANOVA) with Bonferroni post- 
hoc test (c, right, d, f, g)). Data are mean + s.e.m. (all behaviour data from 
n=6 WT and n=7 KO mice; electrophysiology data from n = 9 slices; 

3 mice per genotype). See Supplementary Fig. 1 for gel source data. 
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Figure 2 | Rescue of PSD proteins and striatal neurotransmission. 

a, Experimental groups and TM feeding scheme. Expt, experiment; 

veh, vehicle. b, Western blot from striatal synaptosome preparation 

after tamoxifen feeding shows restoration of most SHANK3 isoforms. 

c, Western blots of striatal synaptosomal fractions (bottom); PSD 

protein levels in KO mice are restored to WT levels in the TM group 

(top). d, Striatal field response in KO mice was rescued in TM mice. 

e, Representative traces for WT, KO and TM mice. Indistinguishable 
relationship of stimulation intensity to the NP1 amplitude among groups 
suggests unaltered presynaptic function. f, Rescued mESPC frequency in 
the striatum in the TM compared to KO mice. g, Increased spine density in 
the TM compared to KO mice, while spine density in the KO mice is lower 
than that of the WT. *P < 0.05, **P < 0.01, ***P < 0.001 (Student’s two- 
tailed t-test (c), two-way repeated measures ANOVA with Bonferroni post- 
hoc test (d, e), one-way ANOVA with Bonferroni post-hoc test (f), one- 
way ANOVA with Newman-Keuls multiple comparison post-hoc test (g)). 
Data are mean +s.e.m. (c:n =3 WT, n=4 TM and n=3 KO; each sample 
represents combined striatal tissue from 2 mice; d and e: n = 12 slices 
from 4 WT, 4 TM and 4 KO mice; f: n= 23 WT, n= 26 KO and n=27 TM 
cells; g:n = 32 WT, n=30 KO and n= 40 TM dendritic segments). See 
Supplementary Fig. 1 for gel source data. 


treated with tamoxifen (wild type, WT), Shank3"*:CreER*/~ mice 
treated with tamoxifen (TM), and Shank3"":CreER*’— mice treated 
with corn oil vehicle (KO). Synaptosomal preparations showed that 
tamoxifen treatment restored the expression of most major SHANK3 
isoforms, including the a and 8 isoforms, to the wild-type level at 
synapses (Fig. 2b). 

Because SHANK3 is the only SHANK protein family member highly 
enriched in the striatum, a region strongly implicated in certain behav- 
iours associated with autism!*!8, mechanistic characterizations were 
primarily carried out on striatal neurons. We found that the synaptic 
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Figure 3 | Adult Shank3 expression rescued repetitive grooming 

and social interaction. a, b, Significantly reduced repetitive grooming 
behaviour in TM compared to KO mice. c, Representative heat maps from 
the social interaction test for all groups. d, Unlike WT mice, KO mice 
showed no preference for social interaction (SI) with a stranger (S) mouse 
rather than a novel object (O); this behaviour is rescued in the TM group. 
*P<0.05, **P < 0.01, ***P < 0.001 (one-way repeated measures ANOVA 
with Bonferroni post-hoc test (a, b, d, left), Kruskal-Wallis test with 
Dunn’s multiple comparison test owing to non-normal distribution 

(d, right)). Data are mean + s.e.m. (a and b:n=9 WT, n=9 TM and 

n= 12 KO mice; d: n= 22 WT, n=30 TM and n=30 KO mice). 
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concentrations of scaffold proteins SAPAP3, Homer1b/c and glutamate 
receptor subunits NR2A, NR2B and GluA2 were significantly reduced 
in KO striatal synaptosomal preparations. However, in the TM condi- 
tion, the synaptic levels of these postsynaptic proteins were significantly 
increased compared to those in the KO mice, reaching comparable 
levels to the WT (Fig. 2c). This molecular alteration shows for the first 
time, to our knowledge, that restoring Shank3 expression in the adult 
brain can efficiently recruit major scaffolding and signalling proteins 
to the synapse and assemble the PSD protein network even after the 
developmental period. 

To investigate whether there were functional changes parallel to 
molecular repair at synapses, we examined striatal physiology. As 
expected, the KO mice showed significantly reduced field population 
spikes. However, the reduced field responses were rescued to WT 
levels in the TM condition (Fig. 2d, e). We then performed whole- 
cell recordings and found that the KO mice had reduced miniature 
excitatory postsynaptic current (mEPSC) frequency compared to 
WT mice. Notably, mEPSC frequency was rescued in the TM con- 
dition to comparable levels of the WT group (Fig. 2f). We also 
measured the paired-pulse ratio, the AMPAR/NMDAR ratio (a-amino- 
3-hydroxy-5-methyl-4-isoxazole propionic acid receptor/N-methyl-p- 
aspartate receptor) (Extended Data Figs 2 and 3), and NR2B (also 
known as GRIN2B)/total NMDAR ratio (Extended Data Fig. 2c). We 
found no differences across genotypes. Overall, these results suggest 
that a primary defect of striatal physiology in KO mice is the reduced 
mEPSC frequency, and this defect can be improved by restoring 
SHANKS expression in adulthood. 

Previous studies have shown that Shank3 manipulation can lead to 
significant changes in the total spine density®””. It is, however, unclear 
whether post-developmental Shank3 expression can also affect spine 
structure. To address this, we used systemic viral injection to sparsely 
label neurons with green fluorescdent protein (GFP) in the WT, KO 
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Figure 4 | Restoring Shank3 expression in adulthood did not 

rescue anxiety and rotarod deficits. a, b, Open-field tests indicated 

that SHANK3 re-expression in adults (TM) does not rescue reduced 
locomotion and reduced rearing (axiogenic behaviour) in KO mice. NS, 
not significant. c, KO mice spend less time exploring the open arm during 
the elevated zero maze test; this behaviour is also not rescued in TM group. 
d, Motor coordination measurement from rotarod is not rescued in TM 
group. e-h, Germline rescued (GR) Shank3** mice show that all above 
parameters for open-field, elevated zero maze and rotarod tests can be 
rescued if Shank3 expression is restored at the germ-cell stage. *P < 0.05, 
**P < 0.01, ***P < 0.001 (one-way ANOVA (a, left, c), Kruskal-Wallis 
test with Dunn’s multiple comparisons (b, left), two-tailed t-test (e, left, 

f, g, left); two-way repeated measures ANOVA with Bonferroni post-hoc 
test (a, right, b, right, d, e, right, g, right, h). Data are mean +s.e.m. (a-c: 
n=18 WT, n=25 TM and n=27 KO mice; d: n=13 WT, n=19 TM and 
n= 21 KO mice; e-h:n=10 WT and n=8 GR mice). 


and TM mice and analysed their dendritic spines in the dorsal striatum. 
Similar to our previous report on Shank3B (a Shank3 isoform) KO 
mice’’, we observed a significant reduction of medium spiny neuron 
(MSN) spine density in the KO compared to WT mice. Surprisingly, 
the total spine density was significantly increased in the TM-treated 
mice compared to the KO mice (Fig. 2g). To our knowledge, this is the 
first indication that the ability of Shank3 to promote spine formation 
or maintenance is not restricted to any developmentally critical period, 
highlighting the continued structural plasticity in the adult striatum. 
Interestingly, the spine density in the TM condition is significantly 
higher than that of the WT. One possibility for this phenomenon is 
that adult Shank3 expression induces dendritic spines that may not be 
pruned and regulated by the same processes that occur in development. 

We next tested how adult SHANK3 expression may affect behav- 
ioural abnormalities. Because striatal defects have been strongly linked 
to repetitive and compulsive behaviours”), and Shank3 KO mice show 
signs of overgrooming!”, we filmed WT, TM and KO mice after treat- 
ment, and quantified their grooming time. The results indicated that 
while there was a significant increase in the percentage of time spent 
grooming in the KO mice compared to WT mice, TM mice exhibited a 
significantly reduced grooming time (Fig. 3a, b). In addition, during the 
tamoxifen treatment, we noticed that some Shank3**:CreER*/~ mice 
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that had initially developed lesions began to heal and regrow their lost 
fur, providing further support that the repetitive/excessive grooming 
phenotype is reversible in the Shank3** mice. 

One of the defining features of autism is the impairment in social 
interaction. Thus, we used a modified three-chamber assay to probe 
voluntary social interaction’*'3”, After habituating to the three-chamber 
box, the test mice were given a choice of interacting with either a novel 
object or a stranger mouse. We found that although WT mice demon- 
strated a strong preference for the stranger mouse compared with 
the novel object, KO mice displayed no such preference (Fig. 3c, d). 
Notably, we found that TM mice behaved similarly to the WT controls 
in that they showed a strong preference for the stranger mouse in both 
the duration and frequency of the interaction (Fig. 3c, d). These data 
show that, similar to the repetitive grooming behaviour, the social inter- 
action deficit can also be rescued in adulthood. Currently, the neuro- 
biological mechanisms of social deficits of autism spectrum disorder 
are not well understood. Although ventral striatum has been associated 
with social interaction’, we did not observe differences in the measure- 
ments of mEPSCs, the AMPAR/NMDAR ratio or the paired-pulse ratio 
in the ventral striatum/nucleus accumbens across genotypes (Extended 
Data Fig. 4). 

Encouraged by the rescue of repetitive grooming and social defi- 
cits, we ran a range of other tests to assess the extent of behavioural 
rescue. In contrast to the social and grooming behaviours, adult Shank3 
re-expression has minimal effect on reduced locomotion, anxiety-like 
behaviour and motor coordination deficits (Fig. 4). In the open-field 
test, TM mice showed no significant difference from the KO mice in 
exploratory behaviour (Fig. 4a) and anxiety-like behaviour including 
rearing time and rearing frequency (Fig. 4b). This result was further 
corroborated by our observations from the elevated zero maze test 
(Fig. 4c). In addition, we found no significant recovery in the motor 
coordination deficit (Fig. 4d). 

Because the cortex and cerebellum have been linked to anxiety and 
motor coordination, we assessed the mechanistic changes in these 
brain regions”*”°. In both regions, SHANK3 expression was restored 
to the WT level in TM mice (Extended Data Figs 5 and 6). However, 
we found minimal differences across the genotypes (Extended Data 
Figs 5-7). Thus, the circuit mechanisms of anxiety and motor coordi- 
nation deficits in the Shank3 KO mice may involve other brain regions 
or could be the collective consequence of connectivity deficits across 
many brain regions. 

To address whether the lack of rescue in these behaviour phenotypes 
is indeed due to missing SHANK3 expression during development, we 
crossed Shank3** mice to 8-actin-Cre mice, enabling germline restora- 
tion of Shank3. The germline rescue mice showed no significant differ- 
ences in striatal physiology (Extended Data Figs 4 and 8), exploratory 
behaviour (Fig. 4e), anxiety (Fig. 4f, g) or motor coordination (Fig. 4h) 
compared to their wild-type littermates, indicating that restoring 
Shank3 expression at the germ-cell stage can restore all behavioural 
phenotypes. Together, these data strongly suggest that there are certain 
developmental defects that are irreversible by adult re-expression of 
Shank3. 

To test whether there is a developmental window for rescuing anxiety 
and motor deficits, we treated postnatal day (P) 20-21 mice with 
tamoxifen to induce Shank3 expression efficiently (Extended Data 
Fig. 9a), and assayed their behaviours in adulthood (Extended Data 
Figs 9 and 10). As previously reported”*, we observed that the treatment 
of young mice with tamoxifen leads to some toxicity including reduced 
body weights in the wild-type mice, and affected their motor coordi- 
nation compared to the KO mice fed with corn oil (Extended Data 
Fig. 10d). However, even with the TM toxicity, we found that on the 
rotarod, the developmentally treated TM mice performed significantly 
better than their KO counterparts (Extended Data Fig. 10a, b). The 
P20-P21-treated TM mice also showed significantly reduced anxiety 
on the elevated zero maze compared to the KO mice (Extended Data 
Fig. 9f). Overall, these behavioural results show that earlier intervention 
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yields more behavioural improvement than adult treatment, further 
supporting the developmental origin of the behavioural abnormalities 
that are irreversible in adults. 

We have shown that in the adult mouse brain, Shank3 expression can 
increase dendritic spine density, restore the PSD protein composition, 
and improve striatal neurotransmission. Although adult gene rescue 
characterizations have been performed on other autism genes?”?8, this 
is the first report, to our knowledge, to show selective rescue in specifi- 
cally autism-related behavioural phenotypes. Notably, we also demon- 
strated that the behavioural deficits that are irreversible in the adult can 
be improved by early postnatal intervention. Our results highlight the 
unique behavioural effect of SHANK3 expression after development, 
and underscore the surprising extent of continued neural plasticity in 
the adult brain. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 

Generation of Shank3 conditional knock-in (fx/fx) mice. The Shank3*“* 
targeting vector was designed by inverting the PDZ domain (exons 13-16) 
and flanking it with the FLEx cassette, which is composed of one pair of 
loxP sites staggered with one pair of lox2722 sites. Shank3*“* conditional 
knock-in mice were generated by homologous recombination in Rl embry- 
onic stem cells and implanting the correctly targeted cells into C57 blasto- 
cysts using standard procedures. Correct locus insertion of the targeting 
construct into the genomic DNA was determined by PCR genotyping using 
two primers End_F (5‘-GGCAGACTCCACACAGTTCCTG-3’) and LoxR 
(5'-GTATCCTATACGAAGTTATTCCGGGTCGAC-3’). Subsequent mouse 
genotyping was determined by PCR of mouse tail or ear DNA using three primers. 
For the WT allele, primer FuncF2 (5’-CGTTTGACACACATAAGCACC-3’) 
and primer FuncFlipR4 (5’-CTCCACCTAGCTGAATTTCCC-3’) were used 
to produce a band of 340 base pairs (bp). For the knockout (Fx) allele, primer 
FuncF2 (5’/-CGTTTGACACACATAAGCACC-3’) and primer Gen_Flx_R1 
(5'‘-GCTGACATCACATTGCTGCC-3’) were used to produce a band of 481 bp. 
For the rescue allele, primer FuncF2 (5’-CGTTTGACACACATAAGCACC-3’) 
and primer FuncFlipR4 (5’-CTCCACCTAGCTGAATTTCCC-3’) were used to 
produce a band of 408 bp. 

Chimaeric males were crossed to C57BL/6J females from Jackson Labs. The 
F, hybrids were crossed with C57BL/6] (-actin Flp to remove the neomycin cas- 
sette. All progeny were bred onto the pure C57BL/6] (Jackson Labs) for at least 
two generations before being bred onto a mixed background with 129S1/SvlmJ 
(Jackson Labs). Heterozygotes were initially bred with heterozygotes to produce 
experimental animals. All germline Shank3"* (KO) and germline rescue (GR) 
mice along with their respective wild-type littermates were produced by breeding 
heterozygotes with heterozygotes. For the adult Shank3 rescue experiments, the 
Shank3 conditional knock-in line was crossed with CAGGS-CreER™. To produce 
enough animals for all necessary experiments, breeding strategy was switched 
to heterozygotes crossed with homozygotes and homozygotes crossed with 
homozygotes for all conditions (Shank3!*:CreER™ ~ bred with Shank3":CreER; 
Shank3®:CreER*’— bred with Shank3™":CreER ~~; Shank3*’+:CreER*’— bred with 
Shank3*/*:CreER 7; Shank3*/~:CreER™— bred with Shank3“*:CreER ‘). It should 
be noted that all animals in the rescue condition were produced from the same litters 
as the animals in the knockout condition. The animals were randomly assigned to 
different conditions. No computerized randomization program was used. 

Animals were housed by genotype at a constant 23°C in a 12-h light/dark cycle 
(lights on at 07:00, lights off at 19:00) with ad libitum food and water. Rescue 
treatment, that is tamoxifen feeding, was initiated on mice at 2-4.5 months. All 
electrophysiological and behavioural experiments were done at least 6 weeks after 
treatment in adult mice with the experimenter being blinded to the genotypes. 
Only age-matched male mice were used for behavioural assays. All experimental 
procedures were inspected and approved by the MIT Committee on Animal Care. 
Open-field test. An automated Omnitech Digiscan apparatus (AccuScan 
Instruments) was used to assess spontaneous locomotion as previously described". 
Anxiety-like behaviours were assessed with the following parameters: time spent 
rearing and frequency of rearing. Locomotion was evaluated by the total distance 
travelled. The first 30 min were evaluated for all parameters. Statistical analysis was 
done using one-way ANOVA with Bonferroni multiple comparison tests. 
Elevated zero maze test. An elevated zero maze was illuminated such that the open 
arm was lit by 60 Lx, and the dark arm was lit by 10-20 Lx. Animals were habituated 
with 10-20 Lx for at least 1h before the test. The animal was introduced into the 
closed arm and allowed to explore the maze freely for 5 min, and this was filmed. 
An observer blinded to the genotype performed analysis using the automated 
tracking software Noldus Ethovision. Anxiety-like behaviour was assessed as the 
percentage of time spent by the animal in the open arm during the 5-min interval. 
Statistical analysis was done using one-way ANOVA with Bonferroni multiple 
comparison tests. 

Rotarod test. Animals were placed on a rotarod apparatus (Med Associates) that 
accelerates 4—40 r.p.m. for 5 min. Each animal was tested for three trials with 1-2h 
between trials in a single day. All trials were filmed. Latency to fall was manually 
analysed for each trial using Noldus Observer software by an observer blinded to 
genotypes. The change in the latency to fall over the course of three trails indicates 
the quality of motor coordination. Statistical analysis was done using two-way 
repeated measures ANOVA with Bonferroni post-hoc tests. 

Grooming. Animals were individually placed into a new cage and allowed to 
habituate. Grooming behaviour was filmed for 2h from 19:00 to 21:00 with red 
light (2 Lx). An observer blinded to the genotype manually quantified grooming 
behaviour using Noldus Observer. All instances of face-wiping, scratching/rubbing 
of head and ears, and full-body grooming were counted as grooming behaviour. 
Statistical analysis was done using one-way ANOVA with Bonferroni multiple 
comparison tests. 
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Social interaction test. A modified version of the three-chamber social interaction 
assay was used as previously described!*!>?, Only age-matched males were used 
for all tests. S129 males were used as stranger mice and were habituated to the test 
chamber for three sessions (20 min each), 1 or 2 days before the behavioural assay. 
On the day of the test, both test and stranger animals were habituated to the test 
room for at least 1h before the start of the assay. The left and right chambers of 
the three-chamber apparatus were both lit by 4-6 Lx during the test session. Each 
test animal was first placed into the centre chamber with open access to both the 
left and right chamber, each of which contained an empty wired cup placed upside 
down. This allowed the animal to habituate to not only the social apparatus, but 
also to the cups that will eventually contain the stranger mice. After 15 min of habit- 
uation, the test animal was moved back to the centre chamber briefly before the 
next session. During the social phase, an age-matched stranger mouse was placed 
randomly into one of the two side chambers while a novel object was placed into 
the other side chamber. The test animal was allowed to explore the social apparatus 
freely and demonstrate whether it prefers to interact with the novel object or novel 
mouse. This social phase was also for 15 min. The placement of the stranger mouse 
and the object was alternated between test mice to eliminate any confounds due 
to chamber bias. Time spent by the test animal in close proximity (~5 cm) to the 
cup containing either the stranger or the object was calculated. Analysis was done 
by an observer blinded to the genotype on Noldus Ethovision. One-way ANOVA 
with Bonferroni post-hoc test was used for statistical analysis. 

Tamoxifen preparation and feeding. Tamoxifen (Sigma, T5648) was dissolved 
in corn oil at 20mg ml! by vortexing. Freshly prepared tamoxifen was protected 
from light by aluminium foil and kept for 2-3 days at room temperature. Animal 
feeding needles from Harvard Apparatus (52-4025) were used for oral gavage. To 
avoid toxicity of tamoxifen, the following dosages were used for adult animals: mice 
at 17-21 g body weight were fed 5 mg day |; mice at 22-25 g body weight were fed 
6mg day '; mice at 26-29 g body weight were fed 7 mg day _'; mice at 30-35 g body 
weight were fed 8mg day |. 

The adult animals were fed for 5 consecutive days followed by 2 weeks of rest. 
The animals were then fed for 5 more consecutive days followed by another 2 weeks 
of rest. Corn oil was used as a control. The mice fed with tamoxifen and with corn 
oil were housed separately to avoid contamination. 

For induction of SHANK3 expression in P20-P21 animals, mice that weighed 
7-9 g received 0.1 ml tamoxifen or corn oil per day for 2-3 consecutive days. 
Mice that weighed 10-12 g received 0.15 ml tamoxifen or coil per day for 2-3 
consecutive days. 

Western blot. PSD and synaptosomal fractions of the striatum, cortex and 
cerebellum were prepared as previously described”'. Purified fractions were 
separated on SDS-PAGE and quantified using Odyssey Licor. $-Actin and tubulin 
were used as loading controls. Specific primary antibody for SAPAP3 was prepared 
as previously described”!. Commercial antibodies used include SHANK3 (Santa 
Cruz, SC-30193), GluR1 (Millipore, MAB2263), GluR2 (Neuromab, 75-002), NRI 
(BD Biosciences, 556308), NR2A (Millipore, 07-632), NR2B (Millipore, 05-920), 
Homerl1 (Chemicon, AB5877, Synaptic Systems, 160022), Homer3 (Synaptic 
Systems, 160303), mGLURS (Abcam, ab76316), CaMKIlIa (Millipore, 05-532), 
SHANKI (Synaptic Systems, 162002), SHANK2 (Cell Signaling, 122188), B-actin 
(Sigma, A5441) and tubulin (Sigma, T5168). Statistical analysis was done using 
two-tailed Student’s t-tests. 

Dendritic spine analysis. Age-matched male mice from WT (n=5), KO (n=4, 
1 had to be euthanized owing to lesion development) and TM (n=5) conditions 
at 6-12 months old were used. The pAAV-hSYN1-EGFP-P2A-EGFPf-WPRE- 
HGHpaA construct was cloned and sent for commercial viral packaging by 
Upenn Viral Core with serotype 2/9. Farnesylated GFP (GFPf) was previously 
shown to target the membrane”. To achieve sparse labelling, ~8—30 1] of this 
virus with titre of 4.04 x 10'? genome copies (GC) ml! was injected through the 
retro-orbital route into each mouse. Three weeks after viral injection, the mice were 
transcardially perfused with 4% paraformaldehyde and sectioned into 200-,1m 
slices. Immunohistochemistry was performed by staining the slices with an 
anti-GFP antibody (Invitrogen, Al1122) for 48h and 24h of secondary antibody 
incubation. The stained slices were then surrounded by a 240-|1m depth spacer 
(Electron Microscopy Sciences) and mounted with Vectashield Mounting Media. 

Confocal images were taken with a x60 objective of the dorsal striatum. Spine 
count on intact neurons began 30-40 1m away from the soma and was extended 
for 10-60 1m from the origin. Spine density was analysed automatically by Neuron 
Studio. All virus injections, imaging and software analysis were done with the 
experimenter blinded to the mouse genotypes. Statistical analysis was done using 
one-way ANOVA, Newman-Keuls post-hoc test. 

Electrophysiology slices, dorsal and ventral striatum. Acute striatal slices were 
prepared from 3-7-month-old age-matched mice. Animals were anaesthetized by 
avertin intraperitoneal injection (tribromoethanol, 20mgmlI |, 0.5mgg_! body 
weight) and transcardially perfused with ice-cold oxygenated NMDG-based 
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cutting aCSF solution (mM): 92 N-methyl-p-glucamine (NMDG), 2.5 KCl, 
1.20 NaH4POug, 30 NaHCO3, 20 HEPES, 25 glucose, 2 thiourea, 5 Na-ascorbate, 
3 Na-pyruvate, 0.5 CaCl, and 10 MgSO, (~300 mOsm, 7.2-7.4 pH). After decapi- 
tation, brains were removed for sectioning in the same ice-cold cutting aCSF using 
a Vibratome 1200S (Leica Microsystems). For all dorsal striatal recordings, 300-j1m 
coronal slices were prepared, unless otherwise stated (Extended Data Fig. 3). 
For all nucleus accumbens (NAc) core recordings, 300-|1m parasagittal slices were 
prepared and NAc core was identified by the presence of anterior commissure. 
Slices were recovered in the same cutting aCSF solution at 32°C for 12 min and 
transferred to room-temperature carbogenated regular aCSF (mM): 119 NaCl, 
2.5 KCl, 1.2 NaH2POx, 24NaHCOs, 12.5 glucose, 2MgSO4.7H2O and 2 CaCl).2H2,O 
(~300 mOsm, 7.2-7.4 pH). Slices were allowed to recover at least > 1h and trans- 
ferred to a recording chamber (RC-27L, Warner Instruments) before recordings. 
Stimulations were performed using a platinum iridium concentric bipolar electrode 
(CBAPC75, FHC). For dorsolateral striatum coronal slices, electrode was placed 
at the corpus callosum to mainly stimulate corticostriatal axons. For dorsolateral 
striatum parasagittal slices, stimulating electrode was placed at the cortex between 
layer V and VI. For ventral striatum parasagittal slices, electrode was placed dor- 
sally to the anterior commissure at the border between NAc core and the cortex. 
Afferents were stimulated with 0.1 ms stimulation step (Isoflex, AMPI) deliv- 
ered at 0.05 Hz frequency (unless otherwise stated). Slices were visualized under 
IR-DIC (infrared-differential interference contrast) using a BX-51 WI microscope 
(Olympus). All slice preparations, recordings and data analysis were performed 
with experimenter blinded to the genotypes. 

Medial prefrontal cortex. Acute brain slices were prepared from 3-7-month-old 
mice as follows. In brief, mice were deeply anaesthetized by intraperitoneal 
injection of avertin solution (20mg ml ', 0.5 mg g_! body weight) and then 
transcardially perfused with 20 ml of carbogenated (95% O, 5% COs) ice-cold 
cutting solution with the composition (in mM): 105 NMDG, 105 HCl, 2.5 KCl, 
1.2 NaH2PO4, 26 NaHCOs, 25 glucose, 10 MgSO4, 0.5 CaCl, 1 L-ascorbic acid, 
3 sodium pyruvate and 2 thiourea (pH 7.4, with osmolarity of 300-310 mOsm). 
The brains were rapidly removed and placed in ice-cold and oxygenated cut- 
ting solution. Coronal slices (300 1m) were sliced using Leica VT 1200S (Leica 
Microsystems) and then transferred to recovery chamber at 32°C with carbogen- 
ated cutting solution for 8 min, followed by transferring to holding chamber con- 
taining aCSF that contained (mM): 119 NaCl, 2.3 KCI, 1.0 NaH POx, 26 NaHCO3, 
11 glucose, 1.3 MgSO, and 2.5 CaCl; (pH was adjusted to 7.4 with HCl, with osmo- 
larity of 300-310 mOsm) at room temperature. Slices were allowed to recover for 
at least 2h in holding chamber before recording and used for experiment typically 
between 3 and 7h after slicing. Layer 5 pyramidal cells with a prominent apical 
dendrite were visually identified with a microscope equipped with IR-DIC optics 
(BX-51WI, Olympus) mainly by location, shape and pClampex online membrane 
test parameters. 

Extracellular field recordings. Slices were prepared from a daily group of mice 
containing all the three genotypes/treatments on randomized order, and recordings 
were obtained at room temperature with carbogenated regular aCSF (~2ml min! 
rate). Borosilicate glass recording microelectrodes (King Precision Glass) were 
pulled on a P-97 horizontal puller (Sutter Instruments) and backfilled with 2M 
NaCl. Recording electrode was placed ~400 1m away from stimulating electrode 
and field population-spike was evoked by a 0.1 ms stimulation step (Isoflex, AMPI) 
delivered at 0.05 Hz frequency. Input-output functions were generated through 
consecutive rounds from 0.1-1.0mA in 0.1 mA increments (triplicate measure- 
ments per stimulation intensity). Three components were resolved in the recording 
traces: stimulation artefact, negative peak 1 (NP1, presynaptic fibre volley) and field 
population spike. Amplitude for each component was determined by the average 
peak amplitude from triplicated measurements per stimulation intensity. Data were 
amplified using a MultiClamp 700B and sampled at 10 kHz using a Digidata 1440A 
acquisition system. Analysis was performed blinded to genotype using pCLAMP 
10 software (Axon Instruments/Molecular Devices). 

Whole-cell recordings. Borosilicate glass recording microelectrodes (King 
Precision Glass) were pulled on a P-97 horizontal puller (Sutter Instruments) 


and backfilled with CsGlu (mM): 110 CsOH, 110 p-gluconic acid, 15 KCI, 4 NaCl, 
5 TEA-Cl, 20 HEPES, 0.2 EGTA, 5 lidocaine N-ethyl chloride, 4 MgATP and 
0.3 NaGTP. Internal pH was adjusted to ~7.3 with KOH and osmolarity adjusted 
to ~300 mOsm with K,SO4. Typical internal resistance was around 3-5 MQ. MSNs 
were visually identified based on their shape, size and location under IR-DIC, 
using a BX-51WI microscope (Olympus). After seal rupture and internal equi- 
librium (5 min to allow proper dialysis of Cs* internal), cells were recorded with 
series-resistance values <20 MQ (recording traces were excluded for data analysis 
if series resistance changed by >20%). All voltage clamp traces were recorded in 
the presence of 501M picrotoxin (Tocris) with theoretical liquid junction poten- 
tial not corrected for. Signals were filtered at 2 kHz, digitized at 10 kHz and data 
acquired using a MultiClamp 700B amplifier and a Digidata 1440A. All analyses 
were performed blinded to the genotype. Grubb’s test was used to remove single 
outliers. 

mEPSCs. Slices were perfused with room temperature carbogenated regular aCSF 
at a rate of approximately 2ml min !. Voltage clamp traces were recorded at hold- 
ing potential —70 mV in the presence of 50|1M picrotoxin (Tocris), 501M pL-AP5 
(pL-2-amino-5-phosphonovaleric acid, Abcam) and 11M tetrodotoxin (Tocris). 
Analysis of mEPSCs was performed using pCLAMP10 (Axon Instruments, 
Molecular Devices) and Minianalysis software (Synaptosoft) by manually click- 
ing of individual events. 

Paired-pulse ratios. Slices were perfused with carbogenated regular aCSF at 
~30°C, ~2ml min“! rate. Stimulus intensity was set to evoke 150-400 pA EPSCs 
at holding potential —70 mV. Two EPSCs (inter-stimulus interval of 50 ms) were 
evoked for 10 consecutive traces. The paired-pulse ratio was calculated by dividing 
the second EPSC peak amplitude by the first one. 

AMPAR/NMDAR ratios. Slices were perfused with carbogenated regular aCSF 
at ~30°C, ~2 ml min“! rate. Stimulus intensity was set to evoke 150-400 pA 
EPSCs at holding potential —70 mV. An average EPSC was obtained (~10 traces) 
at —70mV and then at +40 mV. The AMPAR/NMDAR ratio was calculated as 
the ratio of the average EPSC peak amplitude at -70 mV (AMPAR EPSC) to the 
average amplitude of the EPSC recorded at +40 mV, 50 ms after afferent stimula- 
tion. For pharmacologically isolated AMPAR/NMDAR ratios, dual-component 
evoked EPSCs at +40 mV was recorded before and after pL-AP5 bath application 
(501M, Abcam). NMDAR EPSC was obtained by digitally subtracting the aver- 
age EPSC amplitude after pL-AP5 application (AMPAR EPSC). For NR2B/total 
NMDAR ratio, dual-component evoked EPSC at +40 mV was recorded before and 
after ifenprodil bath application (31M, NR2B antagonist, Tocris) and ifenprodil 
blockable fraction was defined as NR2B EPSCs. pL-AP5 (50,1M) was then applied 
to block all NMDARs for measuring total NUDAR EPSCs. The fraction of NR2B 
containing NMDAR EPSCs is presented as the percentage of the total NUDAR 
EPSCs (NR2B/total NMDAR ratio = ifenprodil sensitive current divided by 
pL-AP5 sensitive current). 

Statistical analyses. All statistical analyses were performed using Prism (GraphPad 
Software). All data sets were analysed using D’Agostino—Pearson omnibus test and 
Shapiro-Wilk test for normality. Data sets with normal distributions were ana- 
lysed for significance using either unpaired Student's two-tailed t-test or ANOVA 
measures with multiple comparison post-hoc test, using *P < 0.05, **P< 0.01, 
** P< (0.001; all data presented as mean + s.e.m. Data sets with non-normal 
distributions were analysed using Kruskal-Wallis test with adjustments for multiple 
comparisons. On the basis of previously published SHANK3 models!” 3, we chose 
similar sample sizes for all experiments performed. All behaviour test results for the 
tamoxifen-treated cohorts are the combination of at least two different large animal 
cohorts that showed the same results. All electrophysiological, biochemical and 
morphological data were obtained by counterbalancing experimental conditions 
with controls. Further details on particular statistical analyses can be found on the 
respective figures/results section for each data set. 


29. Hancock, J. F., Cadwallader, K., Paterson, H. & Marshall, C. J. A CAAX or a CAAL 
motif and a second signal are sufficient for plasma-membrane targeting of ras 
proteins. EMBO J. 10, 4033-4039 (1991). 
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Extended Data Figure 1 | Generation of Shank3 conditional knock-in 
mice. a, i-ii, Schematic of the Shank3 gene and the target locus. 

a, iii, Targeted exons 13-16. a, iv, Neo-cassette excision via breeding with 
germline Flp mice. a, v, Exons 13-16 re-inversion via Cre-expressing mice. 
b, PCR genotyping showing the bands for Fx (knockout), rescue and WT. 
c, Western blot showing rescue of Shank3 expression upon germline Cre 
recombination, with the exception of the putative SHANK3 1 isoform. 
This is probably due to the disruption of a putative intronic promoter by 


2mm 


the introduction of the loxP sites. d, Tamoxifen-inducible Cre strategy 
leads to broad reporter expression. Sagittal sections from pCAGGS- 
CreER*/~:Rosa-floxstop-tdTomato*/~ mice after feeding with tamoxifen 
(left) or corn oil (right). Results show widespread induction of tdTomato 
reporter expression after tamoxifen-induced Cre activation, but not in 
the absence of Cre activity (corn oil feeding); the pCAGGS promoter 
consists of the CMV early enhancer with chicken $-actin promoter’. 
See Supplementary Fig. 1 for gel source data. 
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Extended Data Figure 2 | Additional measurements of dorsal striatum 
synaptic function in TM rescue. a, Representative traces (left) and bar 
graph (right) for the AMPAR/NMDAR ratio in WT, KO and TM groups 
(n=30 WT, n=33 KO and n= 33 TM MSNs). AMPAR/NMDAR ratio 
calculated as the ratio of the EPSC peak amplitude at -70 mV (AMPAR 
EPSC) to the amplitude of the EPSC recorded at +40 mV, 50 ms after 
afferent stimulation. b, Representative traces (left) and bar graph (right) 
for pharmacologically isolated AMPAR/NMDAR ratio (n = 20 WT, 
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n=20 KO and n=23 TM MSNs). Dual-component evoked EPSC at 

+40 mV recorded before and after AP5 bath application. c, Representative 
traces (left) and bar graph (right) for the NR2B/NMDAR ratio in WT, 

KO and TM groups (n= 16 WT, n= 16 KOand n=22 TM MSNs). Dual- 
component evoked EPSC at +40 mV recorded before and after ifenprodil 
bath application. N.S., not significant (Kruskal-Wallis test, with Dunn’s 
multiple comparison test (a), one-way ANOVA with Bonferroni post-hoc 
test (b, c)). Data are mean +s.e.m. 
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Extended Data Figure 3 | Additional measurements of synaptic n=10TM MSNs) slices. One-way ANOVA with Bonferroni post-hoc 
function in dorsal striatum by cortical and corpus callosum evoked test. c, Representative traces (right) and summary bar graphs of AMPAR/ 
stimulation. a, IR-DIC images showing representative placement of the NMDAR ratios evoked in parasagittal (left; n = 17 WT, n= 15 KOand 
stimulation electrode in the cortex (left) or corpus callosum (cc; right) n=16 TM MSNs) and coronal (right; n= 20 WT, n=16 KOandn=15 
to evoke EPSCs in dorsal striatum, using parasagittal (left) and coronal TM MSNs) slices from all three genotypes. AMPAR/NMDAR ratio 
(right) slices. b, Representative traces (right) and summary bar graphs calculated as the ratio of the EPSC peak amplitude at —70 mV (AMPAR 
for the paired-pulse ratios (PPR) of evoked EPSCs in dorsal striatum EPSC) to the amplitude of the EPSC recorded at +40 mV, 50 ms after 
MSNs, showing similar magnitude in parasagittal (left; n =23 WT ,n=20 afferent stimulation (c, right). Kruskal-Wallis test, with Dunn’s multiple 
KOand n=20 TM MSNs) and coronal (right; n =9 WT, n= 11 KOand comparison test. Data are mean + s.e.m. 
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Extended Data Figure 4 | Whole-cell measurements of excitatory 
synaptic function in nucleus accumbens. a, Representative traces 
(top) and summary bar graphs (bottom) of mEPSCs in nucleus 
accumbens (NAc) MSNs (1 = 22 WT, n= 16 KO and n= 18 GR MSNs). 
b, Representative traces (top) and summary bar graph (bottom) of the 
AMPAR/NMDAR ratio in NAc MSNs. The AMPAR/NMDAR ratio was 
calculated as the ratio of the EPSC peak amplitude at —70 mV (AMPAR 
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(o>) foe) ro) 
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EPSC) to the amplitude of the EPSC recorded at +40 mV, 50 ms after 
afferent stimulation (n= 19 WT, n= 16 KOand n=15 GR MSNs). 

c, Representative traces (top) and summary bar graph (bottom) of paired- 
pulse ratios in NAc MSNs (n= 25 WT, n= 22 KO andn=19 GR MSNs). 
One-way ANOVA Bonferroni post-hoc test (a, c), Kruskal-Wallis test, 
with Dunn’s multiple comparison test (b). Data are mean + s.e.m. 
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Extended Data Figure 5 | Western blots of synaptosomal preparations receptors in the adult cortex across genotypes. c, Quantification of 
from the cortex of the adult treated mice show minimal difference multiple synaptic proteins in the cortex of the adult treated mice. All 
across genotypes. a, Representative western blot of SHANK3 in the data, n=3 WT, n=3 TM and n=3 KO mice; each sample is from tissue 
cortex of adult WT, KO and TM mice, showing that most major SHANK3 taken from two animals. Student’s two-tailed unpaired t-test. Data are 
isoforms are restored in the TM mice. b, Representative western blots of mean + s.e.m. See Supplementary Fig. 1 for gel source data. 
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Extended Data Figure 6 | Western blots of synaptosomal preparations c, Quantification of multiple synaptic proteins in the cerebellum of the 
from the cerebellum of the adult treated mice show minimal difference adult treated mice. All data, n =3 WT, n=3 TM and n=3 KO mice; each 
across genotypes. a, Representative western blot of SHANK3 in the sample is from tissue taken from a single animal. Student’s two-tailed 
cerebellum of the adult mice treated with TM, showing that SHANK3 unpaired t-test. Data are as mean + s.e.m. See Supplementary Fig. 1 for gel 
isoforms are restored in the TM mice. b, Representative western blots source data. 


of synaptic proteins in the adult cerebellum across genotypes. 
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Extended Data Figure 7 | Whole-cell measurements of excitatory WT and KO mice (n= 13 WT and n= 10 KO cells; Student’s two-tailed 
synaptic function in the cortex. a, b, Summary bar graphs of mEPSC unpaired t-test). Data are mean +s.e.m. c, Representative traces from 
frequency (a) and amplitude (b) in the prefrontal cortex of the adult mEPSC recordings in the adult prefrontal cortex of the WT and KO mice. 
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Extended Data Figure 8 | Electrophysiological measurements in 

the dorsal striatum of germline rescue mice. a, b, Normal pop spike 
amplitude (a) and NP1 (b) in germline rescued mice (insets show 
representative field traces). c, d, Reduced mEPSC frequency in KO 
compared to WT and GR mice (c). A reduction in mEPSC peak current 
amplitude is observed between KO and WT group only (d) (n= 30 WT, 
n=24 KO and n= 26 GR MSNs). e, Representative traces (left) and bar 
graph (right) for the pharmacologically isolated AMPAR/NMDAR ratio 


(n=9 WT, n=9 KO and n=8 GR MSNs). Dual-component evoked 

EPSC at +40 mV recorded before and after pL-AP5 bath application. 

f, Representative traces (left) and bar graph (right) for NR2B/NMDAR 
ratio in WT, KO and GR groups (n= 6 WT, n=5 KO and n=7 GR MSNs). 
Dual-component evoked EPSC at +40 mV recorded before and after 
ifenprodil bath application. *P < 0.05; **P < 0.01; ***P < 0.001 

(two-way (a, b) and one-way (c-f) ANOVA with Bonferroni post-hoc test). 
Data are mean +s.e.m. 
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Extended Data Figure 9 | Expression of SHANK3 at P20-P21 rescues 
some behavioural measurements. a, Representative western blots 
showing efficient SHANK3 re-expression in the cortex, striatum and 
cerebellum in mice that were treated with tamoxifen at P20-P21. 

b, The total distance travelled as measured by the open-field test was not 
improved in the TM condition compared to the KO condition. One-way 
ANOVA with Bonferroni post-hoc test. c, The open-field total distance 
plotted across 5-min time bins, showing that there is no difference 
between KO and TM conditions across time. d, Rearing activity measured 
by open-field plotted across time, showing that the TM mice perform 
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in between the WT and KO mice for most of the 30-min test. e, Rearing 
time measured by open-field test plotted across time, also showing the 
intermediate performance of TM mice between that of WT and KO mice 
(b-e: n= 21 WT, n=36 KO and n= 30 TM mice; two-way ANOVA with 
Bonferroni post-hoc test). f, Activity on the zero maze indicates that the 
TM condition shows significantly reduced anxiety compared to that of 
the KO condition (n= 18 WT, n=25 KO and n= 30 TM mice; outliers 
were removed using Iglewicz and Hoaglin’s test (two-sided); Kruskal- 
Wallis test, Dunn’s multiple comparison test). Data are mean + s.e.m. See 
Supplementary Fig. 1 for gel source data. 
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Extended Data Figure 10 | Expression of SHANK3 at P20-P21 improves 
motor coordination. a, Summary data from trial 1 of the accelerating 
rotarod test on mice treated with TM at P20-P21 (n= 19 WT, n=35 KO 
and n= 33 TM mice). b, Summary data from trial 2 of the same rotarod 
test on mice treated at P20-P21 (n=18 WT, n= 33 KO andn=33 TM 
mice). ¢, Summary data from trial 3 of the rotarod on mice treated at 
P20-P21 (n= 20 WT, n= 36 KO and n= 33 TM mice). Outliers in 
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a-c were removed using Iglewicz and Hoaglin’s robust outlier test. 
One-way ANOVA with Bonferroni post-hoc test. d, Body weight from WT 
and KO mice that were unfed (UF) and mice that were fed with either corn 
oil or TM (n= 28 WT (UBF), n=14 WT (TM), n=27 KO (UB), n=25 KO 
(CO) and n= 26 KO (TM)). One-way ANOVA with Bonferroni post-hoc 
test. Data are mean +s.e.m. 
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Inhibiting fungal multidrug resistance by disrupting 
an activator— Mediator interaction 


Joy L. Nishikawa’”, Andras Boeszoermenyi’, Luis A. Vale-Silva*, Riccardo Torelli°, Brunella Posteraro®, 
Yoo-Jin Sohn!”, Fei Ji7®, Vladimir Gelev’, Dominique Sanglard‘, Maurizio Sanguinetti>, Ruslan I. Sadreyev’!, 
Goutam Mukherjee!?, Jayaram Bhyravabhotla!+*!8, Sara J. Buhrlage*4, Nathanael S. Gray*“, Gerhard Wagner’, 


Anders M. Nadir)? & Haribabu Arthanari® 


Eukaryotic transcription activators stimulate the expression of 
specific sets of target genes through recruitment of co-activators 
such as the RNA polymerase II-interacting Mediator complex’. 
Aberrant function of transcription activators has been implicated 
in several diseases. However, therapeutic targeting efforts have 
been hampered by a lack of detailed molecular knowledge of the 
mechanisms of gene activation by disease-associated transcription 
activators. We previously identified an activator-targeted three- 
helix bundle KIX domain in the human MED 15 Mediator subunit 
that is structurally conserved in Gall 1/Med15 Mediator subunits 
in fungi**. The Gall 1/Med15 KIX domain engages pleiotropic 
drug resistance transcription factor (Pdr1) orthologues, which 
are key regulators of the multidrug resistance pathway in 
Saccharomyces cerevisiae and in the clinically important human 
pathogen Candida glabrata*®. The prevalence of C. glabrata is 
rising, partly owing to its low intrinsic susceptibility to azoles, 
the most widely used antifungal agent”*. Drug-resistant clinical 
isolates of C. glabrata most commonly contain point mutations 
in Pdr1 that render it constitutively active” '*, suggesting that 
this transcriptional activation pathway represents a linchpin in 
C. glabrata multidrug resistance. Here we perform sequential 
biochemical and in vivo high-throughput screens to identify 
small-molecule inhibitors of the interaction of the C. glabrata Pdr1 
activation domain with the C. glabrata Gall11A KIX domain. The 
lead compound (iKIX1) inhibits Pdr1-dependent gene activation 
and re-sensitizes drug-resistant C. glabrata to azole antifungals 
in vitro and in animal models for disseminated and urinary tract 
C. glabrata infection. Determining the NMR structure of the 
C. glabrata Gal11A KIX domain provides a detailed understanding 
of the molecular mechanism of Pdr1 gene activation and multidrug 
resistance inhibition by ikKIX1. We have demonstrated the 
feasibility of small-molecule targeting of a transcription factor- 
binding site in Mediator as a novel therapeutic strategy in fungal 
infectious disease. 

On the basis of our previous findings that deletion of the KIX 
domain of S. cerevisiae GAL11 or C. glabrata GAL11A abrogates Pdr1- 
dependent transcriptional responses and xenobiotic tolerance we 
hypothesized that the C. glabrata Pdr1-C. glabrata Gall1A (CgPdr1- 
CgGal11A) interaction interface might serve as a promising target for 
novel anti-multidrug resistance compounds’. A fluorescently tagged 
CgPdr1 activation domain (AD) was used in an in vitro fluorescence 
polarization screen!* of ~140,000 chemically diverse compounds 


to identify small molecules that block the interaction between the 
CgGall1A KIX domain and the CgPdr1 AD (Extended Data Fig. 1). 
On the basis of the high degree of conservation between S. cerevisiae and 
C. glabrata, we followed up the top hits from the fluorescence polari- 
zation screen with an azole growth inhibition screen in S. cerevisiae to 
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Figure 1 | Discovery of inhibitors of the CgGall11A KIX-CgPdr1AD 
interaction. a, Schematic of the screening process. b, iKIX1 inhibits cell 
growth in a concentration-dependent manner in the presence of 5M 
ketoconazole (KET); error bars represent means +s.d. from duplicate 
plates. c, iKIX1 structure. d, Fluorescence polarization titration curve 
showing the interaction of CgGall1A KIX domain with CgPdr1 AD30 
fitted to a dissociation constant (Kg) value of 319.7nM+9.5 nM (left). 
iKIX1 competes out CgPdr1 AD30 with an half-maximum inhibitory 
concentration (ICs) value of 190.2 1M + 4.1 |.M (right). The measured 
K, and ICs9 values were used to calculate an apparent inhibition constant 
(Kj) value of 18.1 .M for iKIX1. Data represent mean of two replicates and 
standard error from the fit is shown. 
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Figure 2 | Elucidation of the CgGal11A KIX domain structure; 
CgPdr1 AD and iKIX1 bind to a similar interface on the CgGall11A 
KIX domain. a, Backbone representation of the 10 lowest energy NUR 
structures of CgGall11A KIX domain; backbone r.m.s.d. ~ 0.7 A, left. 
Overlay of CgGal11A (purple) and S. cerevisiae Gall1/Med15 (blue) KIX 
domains, with an overall r.m.s.d. of 2.0 A (ref. 25), right. b, Chemical 
shift perturbations (CSPs) on the CgGall11A KIX domain in the presence 
of CgPdr1 AD (red) or iKIX1 (blue). Residues coloured in red or blue 
indicate a chemical shift perturbation greater than 2 s.d. Residues 


identify hits with in vivo efficacy (Fig. 1a). We identified 5 compounds 
that reproducibly inhibited growth in a concentration-dependent 
manner only in the presence of ketoconazole (Extended Data Fig. 1). 
The most potent compound is referred to as iKIX1 (Fig. 1b, c). 
In vitro binding studies revealed that the Ky of the CgPdr1 AD for the 
CgGal11A KIX domain is 0.32 1M and the apparent Kj for iKIX1 is 
18M (Fig. 1d). 

To facilitate the elucidation of the mechanism of action of iKIX1, we 
determined the high-resolution solution structure of the CgGall1A 
KIX domain with a backbone root mean square deviation (r.m.s.d.) of 
0.7 A (Fig. 2 and Extended Data Table 1; PDB ID 4D7X). The CgGall1A 
KIX domain has 51% sequence identity and 61% similarity with the 
S. cerevisiae Gall1/Med15 KIX domain} with an overall r.m.s.d. of 2.0A 
(Fig. 2a and Extended Data Fig. 2c). The CgGal11A KIX domain forms 
a three-helix bundle harbouring an extensively hydrophobic core and 
a short helix at the N terminus (Fig. 2a). We determined interaction 
interfaces of the CgGall1A KIX domain with the CgPdr1 AD and iKIX1 
(Fig. 2b) by chemical shift perturbation (CSP) analysis. The CgPdr1 AD 
and iKIX1 target the same large hydrophobic groove harboured by the 
three helices. Residues from all three helices constitute the interaction 
interface, and titration of an isoleucine, leucine, valine (ILV)-methyl 
labelled CgGall1A KIX domain reveals large CSPs on the three leucines 
(L19, L23 and L51) upon addition of CgPdr1 AD and iKIX1 (Extended 
Data Fig. 2b). The basic interaction interface on the KIX domain 
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highlighted in green (L19, L23 and L51) represent important CSPs in the 
side-chain methyl groups of an ILV labelled sample. c, The ikIX1 and 
CgPdr1 AD target the hydrophobic groove on the CgGall1A KIX domain, 
which is surrounded by a basic patch. Residues H43, K54, K68, K78, 

R79, F47 and M72 present a positive electrostatic surface enclosing the 
binding interface. d, iKIX1 docked to the CgGall11A KIX domain. iKIX1 
is depicted as red sticks and spheres. Residues near the binding site of 
iKIX1 are depicted as blue sticks. The methyl resonances of L19 and L23 
experience considerable CSP upon addition of iKIX1. 


complements the acidic residues of CgPdr1 AD (Fig. 2c). Residues of 
the CgGal11A KIX domain that interact with CgPdr1 AD and iKIX1 
overlap strongly, suggesting direct competitive binding as the mech- 
anism of inhibition. Docking of ikKIX1 to the CgGall1A KIX domain 
suggests extensive hydrogen bonding and hydrophobic interactions 
between iKIX1 and KIX domain residues (Fig. 2d and Extended Data 
Fig. 2a), matching the interaction interface mapped by CSP analysis. 

To assess the in vivo effects of iKIX1 on Pdr1-dependent transcrip- 
tion, we initially used a strain in which the two S. cerevisiae PDR1 
orthologues (ScPDR1 and ScPDR3) are deleted and which carries a 
plasmid expressing CgPDR13, and a heterologous luciferase gene driven 
by 3 pleiotropic drug response elements (PDREs). Luciferase activity 
was strongly induced by ketoconazole treatment; ikIX1 co-treatment 
was able to block this induction in a concentration-dependent manner 
(Fig. 3a). 

A chromatin immunoprecipitation (ChIP) assay was used to 
examine Gall1/Med15 recruitment to Pdrl-regulated target genes in 
S. cerevisiae after iKIX1 treatment. Gall 1/Med15 was rapidly recruited 
to the promoters of the Pdr1 target genes PDR5 and SNQ2 after keto- 
conazole addition; in contrast, ketoconazole-induced recruitment of 
Gall 1/Med15 was abrogated when the cells were pre-treated with ikIX1 
(Fig. 3b, Extended Data Fig. 3a). iKIX1 did not impede the constitutive 
occupancy of Pdr1 at the same Pdr1-regulated target genes (Extended 
Data Fig. 3b). Consistent with the ChIP data, ikIX1 strongly inhibited 
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Figure 3 | iKIX1 blocks Gall11/Med15 recruitment and upregulation of 
Pdr1 target genes. a, iKIX1 inhibits ketoconazole-induced upregulation 
of luciferase activity in a dose-responsive manner ina S. cerevisiae 

par1 Apdr3A strain containing plasmid-borne CgPDR1 and 3XPDRE- 
luciferase. UT, untreated control; **P < 0.001. b, c, iKIX1 prevents the 
ketoconazole-induced recruitment of Gall1/Med15/Mediator to the 
upstream activating sequences (UAS) of the PDRE-regulated promoter 
ScPDR3S (b), and ScPDR35 induction (c). Representative experiment from 
two biological replicates (ChIP DNA and RNA from same experiment) is 
shown. Error bars indicate s.d. of technical replicates; ***P < 0.00001, 
**P < 0.0005 by two-tailed Student's t-test. d, iKIX1 inhibits 
ketoconazole-induced transcriptional upregulation of CgCDR1 and 


azole-induced transcription of ScPdr1 target genes (Fig. 3c, Extended 
Data Fig. 3a, c). 

Next, we determined the effect of iKIX1 on the transcription of 
C. glabrata Pdr1-regulated genes involved in drug efflux and multi- 
drug resistance (CgCDRI, CgCDR2 and CgYOR1). CgPdr1 targets 
were strongly upregulated after ketoconazole treatment'®'”. However, 
pre-treatment with ikKIX1 reduced target gene induction in a durable 
and concentration-dependent manner (Fig. 3d and Extended Data 
Fig. 4a, b). Treatment with iKIX1 alone did not significantly affect Pdr1- 
target gene induction (Extended Data Fig. 4c, d). 

Next-generation RNA sequencing (RNA-seq) was employed to 
query the genome-wide effects of iKIX1 and azole treatments alone 
and in combination on the transcriptome in both S. cerevisiae and 
in C. glabrata. In accord with previous reports!°!®, azole treatment 
upregulates Pdr1-dependent genes in both yeasts, such as the drug 
efflux pumps ScPDR5 and CgCDRI1 (Supplementary Tables 1 and 2). 
Combined azole and ikIX1 treatment strongly blunted expression of 
many azole-activated and Pdr1-dependent genes in both S. cerevisiae 
and C. glabrata (Fig. 3e, Extended Data Fig. 3d and Supplementary 
Tables 1 and 2), consistent with prior data and the proposed mech- 
anism of action of iKIX1. iKIX1 alone affected very different sets of 
genes in S. cerevisiae and C. glabrata (Supplementary Tables 1-3). 
Treatment of S. cerevisiae and C. glabrata cells with ikKIX1 did not 
significantly alter the expression of PDR1 or GAL11/MED15 after 
azole treatment (Extended Data Fig. 3e). Together, these findings 
suggest that the primary mechanism of synergistic antifungal effects 
of iKIX1 with azoles is through blocking the azole-stimulated and 
Pdr1-dependent drug efflux pathway. 

To ascertain iKIX1 efficacy in azole-resistant C. glabrata strains, we 
examined the effects of iKIX1 on CgPdr1 target gene expression in a 


CgCDR2 in a CgPDR1 wild-type strain (SFY114). **P < 0.005. e, RNA- 
seq analysis of a C. glabrata SFY114 (PDRI wild-type) strain pre-treated 
with iKIX1 or vehicle alone then induced with ketoconazole (iKIX1 + 
KET and KET, respectively). f, iKIX1 inhibits xenobiotic-induced CgPdr1 
transcription in CgPdr1 gain-of-function mutants (amino acid changes 
indicated). Samples shown were induced with ketoconazole. *P < 0.05 and 
**P < 0.01 as compared to dimethylsulfoxide (DMSO) + ketoconazole 
control. g, iKIX1 inhibits rhodamine 6 G efflux in C. glabrata as compared 
to vehicle control. *P < 0.05, **P < 0.005 as compared to DMSO + 
ketoconazole control. a, d, e-g, Data represent the means of three 
biological replicates. Two-tailed Student's t-test used to determine 

P values; error bars represent means +s.d. 


set of isogenic strains with gain-of-function CgPDR1 mutations orig- 
inally identified in azole-resistant C. glabrata clinical isolates”. iKIX1 
reduced azole-induced transcription of CgPdr1 target genes (for exam- 
ple, CgCDR1) in a concentration-dependent manner in all strains tested 
(Fig. 3f and Extended Data Fig. 4d). 

To investigate whether these transcriptional effects translated to 
functional effects on drug efflux rates, we used the fluorescent com- 
pound rhodamine 6G, a substrate of the CgCdr1 efflux pump!”"®. 
Maximum efflux rates were significantly decreased in PDR1 wild-type 
or gain-of-function strains pre-treated with iKIX1, as compared to 
vehicle control (Fig. 3g and Extended Data Fig. 5). 

Owing to its ability to reduce efflux pump gene expression and 
pump activity, we predicted that ikIX1 could restore azole-sensitivity 
to CgPDR1 gain-of-function mutant strains. Isogenic C. glabrata strains 
with wild-type or single gain-of-function alterations across CgPdr1 
(Fig. 4a) were tested for their sensitivity to fluconazole or ketocona- 
zole on gradient plates with increasing concentrations of iKIX1 or 
vehicle. As expected, a CgPDRI wild-type strain was sensitive to both 
fluconazole and ketoconazole, whereas CgPDR1 gain-of-function 
mutant strains grew robustly in the presence of azoles. iKIX1 restored 
azole-sensitivity to PDR1 gain-of-function mutant strains in a concen- 
tration-dependent manner (Fig. 4b). CgPDRI wild-type strains also 
exhibited increased growth inhibition in the presence of both ikIX1 
and azole versus single agents alone (Extended Data Fig. 6a, b). 

On the basis of the strong combination effect of azoles and iKIX1 in 
the CgPDR1!*®°F mutant we focused follow-up studies on this mutant 
strain. To investigate whether azoles and iKIX1 act in a synergistic 
or additive manner in CgPDRI1 wild-type and CgPDR1/75* mutant 
strains, we assessed growth in checkerboard assays with ketocona- 
zole and iKIX1. In the wild-type CgPDR1 strain, the combination 
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Figure 4| iKIX1 as a co-therapeutic in models of C. glabrata 
disseminated disease. a, Schematic showing CgPdr1 gain-of-function 
alterations in relation to putative functional domains. DBD, DNA-binding 
domain; ID, inhibitory domain; MHR, middle homology region; AD, 
activation domain. b, iKIX1 restores the efficacy of azoles towards CgPDR1 
gain-of-function mutants. Plates contained increasing concentrations of 
vehicle control (DMSO) or iKIX1 to 150 ,.M in the absence or presence 

of fluconazole (FLU) or ketoconazole (KET). ¢, iKIX1 in combination 
with fluconazole but not fluconazole alone significantly extended survival 
of G. mellonella larvae injected with CgPDR1!78" (SFY115, n=9). For 
SFY114, n= 10. *P < 0.05, ***P < 0.001 as compared to PBS vehicle 
control. Statistical differences measured using a log-rank (Mantel-Cox) 
test. d, iKIX1 combination treatment with 25 mg kg~! fluconazole (low 
FLU) reduces fungal tissue burden in the kidney or spleen of mice 
injected with CgPDRI1 wild-type (SFY114); iKIX1 in combination with 
100 mg kg! fluconazole (high FLU) reduces fungal tissue burden in the 
kidney or spleen of mice injected with CgPDR1'8" (SFY115). n=5 mice 
for each treatment condition; *P < 0.05, **P < 0.005 and ***P < 0.0001 
as compared to no treatment. Statistical differences measured using a 
Wilcoxon rank-sum test; error bars represent means +s.d. 


of ketoconazole and iKIX1 was additive (Extended Data Fig. 6c). 
However, the CgPDR1/?°F mutant exhibited synergistic growth inhi- 
bition with iKIX1 and ketoconazole combination treatment, with com- 
bination indices <1 (Extended Data Fig. 6d), in concordance with the 
spot-plating assay. 

We carried out a limited analysis exploring the chemical space 
around the iKIX1 scaffold using commercial and custom synthesized 
iKIX1 analogues, identifying several compounds that lost activity in 
all assays; one analogue (A2) is shown in Extended Data Fig. 7a-d. 
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This example, together with data from iKIX1 analogues and the docked 
structure of iKIX1 to the CgGall11A KIX domain, supports a model 
where iKIX1 engages the core of the KIX domain using an array of 
hydrophobic and hydrogen-bond contacts. 

We used two metazoan model systems to evaluate the potential utility 
of iKIX1 as a co-therapeutic with fluconazole to treat disseminated 
C. glabrata infection. The larvae of the moth Galleria mellonella has 
been used as a model to test the pathogenicity of a wide variety of 
human pathogens’. We used a G. mellonella survival assay to determine 
the virulence of C. glabrata PDR1 wild-type or PDR1‘”5™ strains in the 
presence of fluconazole, iKIX1, or a combination of the two (Fig. 4c). 
Larvae were injected with C. glabrata and a single injection of flucona- 
zole (50mgkg~'), iKIX1 (25 mgkg7'), a combination of the two, or 
vehicle; survival was monitored every 24h. G. mellonella injected with 
wild-type CgPDR1 was sensitive to fluconazole alone, and exhibited 
no significant alterations in survival with a fluconazole-iKIX1 combi- 
nation. However, in G. mellonella larvae injected with a CePDR 1'78°" 
strain, whereas the single agents fluconazole or iKIX1 did not signifi- 
cantly increase survival compared to vehicle, co-treatment with iKIX1 
and fluconazole significantly increased survival (P < 0.001). 

Prior to mammalian studies, we sought to evaluate the poten- 
tial toxicity of iKIX1 in mammalian cells (Extended Data Fig. 7e, f). 
Human HepG2? cells treated with ikKIX1 revealed toxicity only at high 
concentrations of iKIX1 (ICso  100|1M). iKIX1 had no effect on the 
transcription of SREBP-target genes at concentrations up to 100|.M, 
indicating its specificity for the fungal Gall 1/Med15 KIX domain*. We 
also assessed the in vitro stability and in vivo mouse pharmacokinetics 
of iKIX1 and found that iKIX1 exhibited favourable drug-like proper- 
ties and in vivo exposure in these studies (Extended Data Fig. 8 g, h). 

To evaluate the therapeutic potential of ikKIX1 and azole antifungal 
co-therapy ina mammalian model, we initially turned to an established 
mouse model of disseminated fungal disease'!. Mice were inoculated 
with C. glabrata by tail-vein injection and were dosed peritoneally once- 
daily with 100 mgkg~ fluconazole (high FLU), 100 mgkg~! iKIX1, a 
combination of the two, or vehicle alone. After 7 days, mice injected 
with a CgPDR1 wild-type strain exhibited significantly reduced tissue 
fungal burden in the kidney and spleen following fluconazole treatment 
alone; iKIX1 co-treatment did not result in further reductions (Fig. 4d). 
In contrast, in mice injected with the azole-resistant CgPDR 1/78" 
strain, only co-treatment with iKIX1 and fluconazole resulted in sig- 
nificant (~tenfold) reductions in fungal burdens in the kidney and 
spleen (P < 0.0001) (Fig. 4d). Similar results were observed with the 
clinically isolated CePDR1* and CgPDR1!®F strains DSY562 and 
DSY565 (Extended Data Fig. 8a). Consistent with previous studies?, the 
fungal burden in mice infected with the CgPDR1!*®” strain was higher 
than those infected with wild-type CgPDR1 strains, suggesting that 
PDRI mutant strains may be more virulent in vivo. Similar but less pro- 
nounced results were found in mice injected with a CePDR1?®" strain 
(Extended Data Fig. 8b). When mice were injected with a CgPDR1 
wild-type strain and dosed with 25 mg kg~' fluconazole (low FLU) 
alone or in combination with iKIX1, fluconazole alone poorly reduced 
tissue burden, whereas combination treatment resulted in significant 
(~tenfold) reductions in fungal burdens in both organs (P < 0.0001) 
(Fig. 4d). These results suggest that ikKIX1 combination treatment with 
azole may be therapeutically desirable even in the absence of CgPDR1 
gain-of-function mutations. Mice infected with a Cgpdr1 null strain 
were more sensitive to iKIX1 alone; unlike mice infected with CgPDR1* 
or CgPDR1‘8" strains, low doses of iKIX1 did not further reduce 
fungal burden in Cgpdr1 null infections (Extended Data Fig. 8c, d). 

CgPDR1 gain-of-function mutations are also known to control 
adherence to host cells”°. As previously observed”, a PDR1!7F mutant 
increased relative adherence to epithelial cells as compared to a PDR1 
wild-type strain. Strikingly, ikIX1 treatment alone reduced adherence 
to levels similar to a PDR1 wild-type strain (Extended Data Fig. 8e). 
Ketoconazole alone or co-treatment with iKIX1 also reduced relative 
adherence to levels comparable to a PDR1 wild-type strain. To assess 
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the role of ikKIX1 in modulating adhesion in an infection model, we 
turned to a mouse model of urinary tract infection”! In both the blad- 
der and kidney, iKIX1 alone was sufficient to decrease fungal load after 
infection with either a PDRI wild-type strain or a PDR1/®”" strain 
(Extended Data Fig. 8f), suggesting that iKIX1 may indeed modulate 
adhesion. 

The proportion of azole-resistant C. glabrata (up to 20% in the US) 
and the emergence of multidrug resistance (approximately 40% of 
echinocandin-resistant isolates are azole-resistant) shows the need for 
novel treatments that can target these resistant populations”””’. Our 
results demonstrate that small-molecule disruption of the interaction 
between the CgGall 1A KIX domain and the CgPdr1 activation domain 
is a therapeutically tractable method for resensitizing azole-resistant 
C. glabrata to standard azole antifungal treatment (Extended Data Fig. 9). 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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Library name Number of 
molecules 
Biomol ICCB 
Known 
Bioactives3 
Ninds Custom 1,040 
Collection 2 
Prestwick 1 1,120 
Collection 
ChemBridge 3 10,560 
ChemDiv 4 14,677 
Life Chemicals 1 3,893 
Maybridge 5 3,212 
ChemDiv 3 16,544 
Maybridge 1 8,800 
ChemDiv 6 44,000 


A 600 


6 
eee] oa 


Total number of 143,280 
molecules 


Extended Data Figure 1 | List of compound libraries screened and top 
hits from the secondary in vivo viability screen. Left, table of compound 
libraries that were screened using a fluorescence polarization assay at the 
Institute of Chemistry & Cell Biology (ICCB) facility at Harvard Medical 
School. Right, an S. cerevisiae viability screen identifies small molecules 
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dependent manner in the presence of 541M ketoconazole. Top hits from 
the screen are shown; A600 nm values are the average of values from 
duplicate plates. Error bars denote mean + s.d. 
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Extended Data Figure 3 | iKIX1 blocks Gall 1/Med15 recruitment and 
upregulation of Pdr1 target genes. a, ikKIX1 prevents the ketoconazole- 
induced recruitment of ScGall1/Med15/Mediator to the upstream 
activating sequences (UAS) of the PDRE-regulated promoter ScSNQ2 and 
transcriptional upregulation of ScSNQ2. b, Haemagglutinin (HA)-tagged 
Pdr1 occupies PDRE-regulated promoters of ScPDR5 and ScSNQ2 in the 
presence of 20 11M iKIX1 or vehicle (DMSO) control before and following 
ketoconazole addition. c, 20 1M iKIX1 inhibits ketoconazole-induced 
upregulation of ScPdr1 target genes ScPDR5 and ScSNQ2 in the HA-Pdr1 
strain. RNA was harvested concurrently with representative chromatin 
immunoprecipitation experiment shown in b at t=0 min (DMSO, 

20 1M iKIX1) and t= 15 min after ketoconazole induction (DMSO + 
KET, 201M iKIX1 + KET). Transcripts are normalized to ScSCR1 and 
un-induced DMSO control. a-c, Representative experiment from two 


™PDR1 ™GAL11A 


biological replicates is shown. Error bars represent mean +s.d. of technical 
replicates; *P < 0.05, **P < 0.01 and ***P < 0.001 as calculated by two- 
tailed Student's t-test. d, RNA-seq analysis of a wild-type S. cerevisiae 
strain (BY4741) pre-treated with iKIX1 or vehicle alone then induced with 
ketoconazole (iKIX1 + KET and KET, respectively) demonstrate a blunted 
induction of Pdr1 target genes following iKIX1 pre-treatment. Data 
represent means of three biological replicates. e, iKIX1 pre-treatment does 
not significantly alter the transcript levels of PDR1 or GAL11/GAL11A in 
S. cerevisiae or C. glabrata after azole induction. Cells were pre-incubated 
with vehicle (DMSO) or iKIX1 and then induced with 40 11M ketoconazole 
(+KET) for 15 min before harvest. Average value of three biological 
replicates is shown and error bars represent mean +s.d.; *P < 0.05, 

**P < 0.001 as compared to DMSO or DMSO+KET control, calculated by 
two-tailed Student's t-test. 
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Extended Data Figure 4 | iKIX1 pre-treatment durably represses 
azole-induced CgPdr1-dependent transcription but not basal levels 

of CgPdr1-dependent transcripts. a, b, With ikKIX1 pre-treatment, 
CgPdr1-dependent transcription of (a) CgCDR1 and (b) CgYOR1 remains 
repressed 120 min after ketoconazole induction. SFY114 (PDR1 wild- 
type) cells were pre-incubated with vehicle (DMSO) or iKIX1 and then 
induced with 40 |1M ketoconazole (+-KET). Transcript levels were assessed 
by quantitative RT-PCR before and for 120 min following ketoconazole 


induction. Transcript levels are normalized to CgRDN25-1 and 
un-induced vehicle control (DMSO) at t= 0. c, iKIX1 treatment alone 
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does not have significant effects on CgPdr1 target gene induction either in 
the presence of wild-type (SFY114) or gain-of-function mutant CgPDR1 
(amino acid alterations indicated). d, Table of average CgCDR1 delta Cp 
values (Cpcgcpri — CPcgrpnzs-1) and corresponding standard deviation for 
quantitative real-time PCR experiments shown in Fig. 3 f and Extended 
Data Fig. 4c. a-d, For all panels of Extended Data Fig. 4, average value 

of three biological replicates is shown and error bars represent +s.d.; 

*P < 0.05, **P < 0.01, and ***P < 0.005 as compared to vehicle 

+ ketoconazole control. P values calculated using two-tailed Student's t-test. 
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Extended Data Figure 5 | iKIX1 inhibits efflux of rhodamine 6G in 
PDR1 wild-type, PDR1’*° and PDR1*** strains. Data points indicate 
mean of three biological replicates and error bars represent mean +s.d. 
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bearing wild-type CgPDR1 to azole treatment and synergistically synergistically inhibit the growth of the CgPDR1!78"" mutant. c, d, The 
enhances azole sensitivity ina CgPDR1'”°F mutant strain. a, b, Two EUCAST broth microdilution method”? was used to assess the effects 
strains bearing wild-type CgPDR1 alleles (SFY114, DSY759) were plated of iKIX1 and ketoconazole combination treatment. Growth, as assessed 
at concentrations differing by tenfold (10x, 1x) on plates containing by As40 nm, Was normalized to no drug control. All combination indices 
increasing concentrations of iKIX1 to 300.M in the presence or absence (CI) for the CgPDR1!78F mutant were less than 1, indicating synergy. 
of 1 1M ketoconazole (a) or ikKIX1 to 250 |1M in the presence or absence A representative of three biological replicates is shown and the red line 
of 50 1M fluconazole (b). c, iKIX1 and ketoconazole have additive effects indicates a combination index of 1. 


© 2016 Macmillan Publishers Limited. All rights reserved 


LETTER 


a b C 
90 > 
3005 se 8 75 
N 
C L NON Jl 4 
250 + a: 60 
200 


150°) \C50= 1000.1 pM 


Fluorescence Anisotropy (a.u.) 


luciferase activity normalized to growth 
and untreated control 
fold change CgCDR1 normalized to 
RDN25-1 
o 
oOo 


1005 
50-4 15 
ae '0 0 10 10 10 e 
10° 10° 10! 2 3 4 
[A2] uM SOS ES 
\ SP 
O 0% an* Y 
OF oh 
RN 
Re: 
3 
Le) 


PDR 11280F 
PDR1R376w 
PDR17588A 
PDR 121082G 
DMSO 
e f Q mouse plasma microsomal stability 
stability (1 mg/mL + NADPH) 
HepG2 cell viability 3 100 100 
B 150 F 2 80 2 80 
8 100 S = 60 = 60 
g o 5 40 
2 50 > © 40 g 
s £ & 20 & 20 
* [s) 
0 = 
1 10 100 5 @2PasBRS . ? 
iKIX1] uM - 2 = 0 20 40 60 0 20 40 60 
S$ [ikixt] pM minutes ‘cinuias 
Oo 


—®—human —= mouse 
=SCD1 “FASN = HMGCR 


h ’ 
piv | 2 | - | 1267451] 1502.00 | 1521.92 2190 | 223 | - | 

Extended Data Figure 7 | iKIX1 exhibits favourable drug-like t-test. e, iKIX1 inhibits viability of HepG2 cells at concentrations >50 \1M. 
properties and an inactive analogue highlights the importance of The mean of 3 biological replicates is shown; error bars represent means 
the electron withdrawing chloride groups on the aromatic ring of +s.d. f, iKIX1 exhibits no effect on transcription of SREBP-target genes 
iKIX1. a-d, A structurally similar ikKIX1 analogue (A2) lacking electron- in HepG2 cells at concentrations up to 100|.M. Biological duplicates were 
withdrawing groups increases the ICso in the fluorescence polarization assessed; representative experiment is shown and error bars represent 
assay (a) and abolishes activity in the S. cerevisiae luciferase reporter assay means +s.d. of technical (real-time PCR) replicates. g, Mouse plasma 
(b), repression of CgCDRI1 expression (c), and synergistic C. glabrata cell stability of iKIX1 and mouse and human microsomal stability of iKIX1, 
growth inhibition with azoles (d). Error bars in b, c indicate mean +s.d.of |= n=1.h, In vivo pharmacokinetic parameters of iKIX1, n =3 mice per 
technical replicates (reads/real-time PCR reactions, respectively). time point. 


**P < 0,005; statistical differences calculated using two-tailed Student's 
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Extended Data Figure 8 | iKIX1-fluconazole combination treatment 
reduces fungal tissue burden by CgPDR1* or CgPDR1 gain-of-function 
mutant strains in disseminated disease models; iKIX1 alone reduces 
adherence and fungal burden in a UTI model. a, Clinical isolates 
DSY562/DSY565 (azole sensitive and PDR1!78"" azole-resistant strains, 
respectively) behave similarly to SFY114/SFY115 (isogenic PDR1* and 
PDR175F strains, shown in Fig. 4d) in the mouse infection model. n = 10 
mice for each treatment condition; *P < 0.01, **P < 0.005 and 

#3 D < (),0001. b, iKIX1 combination treatment with fluconazole reduces 
fungal tissue burdens in the spleen or kidney of mice injected with 

C. glabrata PDR1?**" (SFY116). n=5 mice for each treatment condition; 
**P < 0,01 and *P < 0.05. c, 100mgkg~' day! iKIX1 (high ikKIX1) 
treatment of mice infected with SFY93 (pdr1A) significantly reduces 
fungal burden in a mouse infection model (colony forming units (CFU) 
per g of kidney) alone as compared to SFY114 (PDR1*) or SFY115 
(PDR1!7®°F), n= 10 mice for each treatment condition; *P < 0.01, 
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**P < 0.005, ***P < 0.0001. d, Mice infected with SFY114 (PDRI1*), 
SFY115 (PDR1/8°F) or SFY93 (pdr1A) were treated with low (10 mgkg~! 
day~') iKIX1, low fluconazole (low FLU; 25mg kg~! day”), fluconazole 

at 100 mgkg~! day’ or combination with the two. iKIX1 did not confer 
additional reductions in CFU per g of kidney with SFY93 infection. 

n= 10 mice for each treatment condition. ***P < 0.0005. e, iKIX1 and 
ketoconazole reduce adherence of CgPDR1!*8°F (SFY115) to CHO-Lec2 
cells. Adherence is normalized to SFY114 DMSO control; each column 
represents the average of 4 biological replicates. *P < 0.05 as compared 

to SFY114 DMSO control. f, iKIX1 (100 mgkg~! day~') or fluconazole 
significantly reduces fungal burden in the bladder and kidney in a urinary 
tract infection model in mice. n= 15 mice were infected in each group and 
points at 0 logio CFU per g of organ fell below the detection limit of the 
method (50 CFU per g of organ). *P < 0.05, **P < 0.005. a-f, Statistical 
differences were measured using a Mann-Whitney/ Wilcoxon rank-sum 
test as compared to no treatment control; error bars represent means +s.d. 
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Extended Data Figure 9 | Model of iKIX1 function. iKIX1 functions as a co-therapeutic in combination with an azole, blocking the azole-induced 
recruitment of Gall1/Med15-Mediator to Pdr1 target genes upon azole-treatment and preventing the upregulation of Pdr1 target genes, including those 
which encode drug efflux pumps. This figure has been adapted from ref. 3. 
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Extended Data Table 1 | Summary of quality statistics for the ensemble of 10 structures calculated with AMBER explicit water refinement 
and list of experimental restraints 


Protein 
NMR distance and dihedral constraints 
Distance constraints 
Total NOE 1718 
Intra-residue 602 
Inter-residue 1116 
Sequential (|i-j] = 1) 517 
Medium-range (|i-j| < 4) 488 
Long-range (|i-j| > 5) 111 
Hydrogen bonds 0 
Total dihedral angle restraints 158 
phi 79 
psi 79 
Structure statistics 
Violations (mean and s.d.) 
Distance constraints (A) 0.083 + 0.039 
Dihedral angle constraints (°) 2.407 + 0.357 
Max. dihedral angle violation (°) 10.267 
Max. distance constraint violation (A) 0.248 
Deviations from idealized geometry 
Bond lengths (A) 0.013 
Bond angles (°) 1.9 
Average pairwise r.m.s.d.** (A) 
Heavy 1.1 
Backbone 0.7 


**Pairwise r.m.s.d. was calculated among ordered residues (3-83) of 10 refined structures. 
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Tumour-specific proline vulnerability uncovered by 
differential ribosome codon reading 


Fabricio Loayza-Puch'*, Koos Rooijers'*, Levi C. M. Buil?, Jelle Zijlstra', Joachim F. Oude Vrielink!, Rui Lopes!, 
Alejandro Pineiro Ugaldel, Pieter van Breugel', Ingrid Hofland*, Jelle Wesseling*, Olaf van Tellingen’, 


Axel Bex® & Reuven Agami!® 


Tumour growth and metabolic adaptation may restrict the 
availability of certain amino acids for protein synthesis. It has 
recently been shown that certain types of cancer cells depend on 
glycine, glutamine, leucine and serine metabolism to proliferate 
and survive’. In addition, successful therapies using L- 
asparaginase-induced asparagine deprivation have been developed 
for acute lymphoblastic leukaemia. However, a tailored detection 
system for measuring restrictive amino acids in each tumour is 
currently not available. Here we harness ribosome profiling® 
for sensing restrictive amino acids, and develop diricore, a 
procedure for differential ribosome measurements of codon 
reading. We first demonstrate the functionality and constraints 
of diricore using metabolic inhibitors and nutrient deprivation 
assays. Notably, treatment with L-asparaginase elicited both 
specific diricore signals at asparagine codons and high levels 
of asparagine synthetase (ASNS). We then applied diricore to 
kidney cancer and discover signals indicating restrictive proline. 
As for asparagine, this observation was linked to high levels 
of PYCRI, a key enzyme in proline production’, suggesting a 
compensatory mechanism allowing tumour expansion. Indeed, 
PYCR1 is induced by shortage of proline precursors, and its 
suppression attenuated kidney cancer cell proliferation when 
proline was limiting. High PYCR1 is frequently observed in 
invasive breast carcinoma. In an in vivo model system of this 
tumour, we also uncover signals indicating restrictive proline. We 
further show that CRISPR-mediated knockout of PYCR1 impedes 
tumorigenic growth in this system. Thus, diricore has the potential 
to reveal unknown amino acid deficiencies, vulnerabilities 
that can be used to target key metabolic pathways for cancer 
treatment. 

In previous work we used ribosome profiling to map changes in 
translation of wild-type and disease-triggering mutant mitochon- 
dria’. A mitochondrial disease-promoting tryptophan transfer RNA 
(tRNA) mutation elicited specific pausing of mitochondrial ribosomes 
at tryptophan codons and enrichment of ribosome-protected fragments 
(RPFs) covering these sites. We hypothesized that also in the cytosol 
global occupancy of ribosome positions could be used as a cellular 
detector of alterations in the availability of amino acid for protein 
synthesis. To examine this issue we constructed a protocol to meas- 
ure differential ribosome codon reading (diricore), which is based on 
ribosome profiling measurements. We performed two complementary 
analyses, subsequence and 5/-end density (Fig. 1a). The positions 9, 
12 and 15 from the 5’-end of the RPFs correspond to the E (exit), 
P (peptide bond), and A (tRNA recruitment and reading codon) sites 
of the ribosome’. 


To evaluate our approach we first examined ribosome profiling 
performed on yeast treated with 3-amino-1,2,4-triazole (3-AT), an 
inhibitor of histidine synthesis’. As expected, subsequence analysis 
resulted in strong signals in the two histidine-codons at the reading 
codon (position 15, Fig. 1b). RPF density analysis of histidine codons, 
but not control phenylalanine, showed a vast increase at position 15 
(Fig. 1c). Out-of-frame and randomization analyses indicated a signif- 
icant and robust signal (Fig. 1b and Extended Data Fig. 1a). Second, we 
tested harringtonine, an agent that immobilizes initiating ribosomes 
and depletes elongating ones!!!?, in human cells. As expected, diricore 
detected a significant and robust increased signal only at the initiator 
ATG codon (Fig. 1d, e and Extended Data Fig. 1b). This signal was at 
the P-site (position 12), coinciding with harringtonine’s function in 
blocking elongation. Third, we analysed by diricore the response of 
cancer cells to asparagine deprivation using L-asparaginase treatment. 
This uncovered a specific signal at asparagine codons in the treated cells 
(Fig. 1f), which was highly reproducible and statistically significant 
(Extended Data Fig. 2a, b). Out-of-frame and randomization analyses 
showed that only asparagine codons at position 15 give a true signal, 
excluding bias in the data (Extended Data Figs 2c and Ic). Interestingly, 
in response to L-asparaginase the expression of the enzyme responsi- 
ble for asparagine synthesis in the cells (ASNS) increased ~fourfold 
(Fig. 1g, Extended Data Fig. 2d), providing evidence for a feedback loop 
activated in response to asparagine deprivation. To verify the aspar- 
agine-specific signal by diricore we performed tRNA-aminoacylation 
measurements!? (Extended Data Fig. 2e). This analysis indicated a 
significant increase in the non-aminoacylated forms of the asparagine 
(Asn)-tRNAs GTT and ATT, but not of the controls leucine (Leu)- 
tRNA CAG and valine (Val)-tRNA TAC (Fig. 1h). As the total levels of 
Asn-tRNAs did not change following asparaginase treatment (Extended 
Data Fig. 2f), our results link asparagine availability with Asn-tRNA- 
aminoacylation and the emergence of asparagine codon signals in dir- 
icore. Altogether, we provide here evidence indicating that diricore is 
able to detect specific signals of amino acid deficiencies. 

Next, we tested diricore in conditions of vast global translation 
changes using basal and luminal breast cancer cell lines (MDA-MB-231 
and Hs587T, and T47D and MCE’, respectively). While basal cells are 
highly sensitive to glutamine deprivation, luminal cells are relatively 
resistant owing to sustained control of translation elongation by main- 
taining inhibitory phosphorylation at threonine 56 of the key elon- 
gation factor eEF2 (refs 14, 15 and Fig. 2a, b). Following glutamine 
deprivation, diricore revealed markedly reduced signals in the initiator- 
ATG codon position 12 of the basal cells, while little signal was 
observed in the luminal ones (Fig. 2c). In contrast, diricore signals 
at glutamine codons appeared only in the deprived luminal cells. 
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Figure 1 | Ribosome positions as readout for amino acid availability 
for protein synthesis. a, A scheme of the protocol for differential 
ribosome codon reading (diricore). Subsequence analysis examined 

the codons at 9, 12 and 15 nucleotides from the 5’-ends of RPFs. RPF 
density analysis examined every instance of a codon throughout the 
transcriptome to inspect the local RPF 5’-end density surrounding that 
codon. While the latter makes use of all RPFs, subsequence analysis takes 
into account only those that are in frame with the coding sequence. 

A detailed overview is available in Extended Data Fig. 1g, h. b, c, Diricore 
analysis of data obtained from ref. 10 of yeast cells treated with 3-amino- 
1,2,4-triazole (3-AT), an inhibitor of His production. +P < 1 x 107°°° for 


Thus, diricore signals at the initiator-ATG codon can be used to esti- 
mate global changes in translation, and specific codon signals appear in 
conditions where strong global inhibition of translation had not arisen. 

Next, we examined whether diricore could detect restrictive amino 
acids in cancer. We obtained directly from surgery a kidney with clear 
cell renal cell carcinoma (ccRCC), and sampled two normal and four 
tumour tissues for ribosome profiling (Extended Data Fig. 3a). Quality 
control of the ribosome profiling indicated high and comparable quality 


CAC and CAT His-codons by out of frame analysis. d, e, Diricore 
analysis of the SUM1315 cells treated with harringtonine, an inhibitor of 
initiation of translation!»’?. +P <1 x 10~°° for the ATG codon by out 

of frame analysis. f, Diricore analysis of PC3 cells treated for 48 h with 
L-asparaginase (ASNase). tP= 1.05 x 10~° for AAT and P= 1.11 x 10~4 for 
AAC Asn-codons by out of frame analysis. g, Ribosome profiling analysis 
shows overexpression of the ASNS gene, a key component in the synthesis 
of asparagine. h, Analysis of the relative levels of non-aminoacylated 
tRNAs in PC3 cells treated for 48 h with L-asparaginase. For h, error bars 
represent s.d.; n = 3 independent biological replicates; **P < 0.01 by 
Student's t-test. 


of all samples, and gene expression analysis verified ccRCC (Extended 
Data Fig. 3b-d). Notably, diricore analysis comparing tumour to nor- 
mal tissues revealed two consistent, statistically significant, and robust 
signals in methionine and proline codons (Fig. 3a and Extended 
Data Figs 4a, b and 1d). Control comparisons showed no signal, and 
other codons (for example, alanine) were either not consistent in all 
tumours or did not show signal in the 5’ density plots (Extended Data 
Fig. 4c, d). 


a b MCF7 T47D MDA-MB-231 __ Hs587T Figure 2 | Estimation of global changes in 
Gnh: 08 14 3 0 14 38 0 141 8 0 14 8 translation and specific codon signals by 
j diricore. a, Normalized cell growth of the 
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(Thr56) -~ a _ “ -~ ~ indicated breast cancer cell lines at different 
—=— T47D : : 4 
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eEF2 bane a aap ee ee taken 48h after plating. peEF2, phosphorylated 
—e— MDA-MB-231 eEF2. b, Immunoblot analysis of cells grown in 
—e— Hs578T ; complete media or under glutamine starvation 
c Luminal cell lines Basal cell lines for the indicated times. c, Diricore analysis of cell 
+0 T47D MCF7 MDA-MB-231 Hs578T lines deprived of glutamine for 48 h. For a, values 
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Figure 3 | Diricore detects limitations in proline availability in ccCRCC. levels were measured by reverse transcription followed by quantitative 


a, Samples were subjected to diricore analysis. Shown is the comparison 
tumour 1 versus normal 2 (T1 vs N2), while in Extended Data Fig. 4a 
additional comparisons are shown. Highlighted are the tumour-specific 
signals at the methionine and proline codons. Combined statistical 
analysis of all T vs N comparisons is shown in Extended Data Fig. 4b. 
Asterisk in the right panels marks an artificial signal from a 5’ bias in the 
ligation/amplification step. b, Tumour and normal kidney tissue were 
subjected to Pro-tRNA and control Leu-tRNAs aminoacylation analysis. 
c, Expression analysis of PRODH2 in ccRCC tumours. Differential 
expression test was done comparing ccRCC to normal samples using 
the R-package limma. d, Expression analysis shows high PYCR1 levels 
in ~4% of tumours (marked red). e, Global change in codon abundance 
(codon demand) was performed by calculating the relative number of 
proline codons in the normal and tumour sections using RNA-seq data 
from the same samples as in a. f, Analysis of translational efficiency 
changes (ATE) vs proline codon content of genes. g, The cCRCC A498 
cell line was deprived of glutamine for 24 and 48 h, and PYCRI mRNA 


Intriguingly, the strong signal in methionine originated solely from 
the initiator-ATG codon, suggesting global alteration in translation 
rates (Fig. 3a and Extended Data Fig. 4b). The position of the peak 
(P-site) suggests that increased translation initiation rate in the tumour 
is the underlying mechanism. A relative increase in initiation of transla- 
tion, in conditions when the initiation to elongation step becomes rate 
limiting, would generate relatively more ribosomes with the initiator- 
ATG codon at their P-site. Supporting evidence for this scenario is the 
strong elevation of phospho-4EBP1, a canonical marker of initiation 
of translation, in the tumour compared to normal tissue (Extended 
Data Fig. 5a). Also the strong depletion of the signal at the P-site of the 
initiator- ATG following short inhibition of mTOR (an upstream kinase 
of 4EBP) or nutrient starvation supports this notion (Extended Data 
Fig. 5b). Thus, diricore identified a signal at the initiator-ATG codon 
that suggests enhanced global translation initiation in the tumour. 

In contrast to the initiator-ATG, the signal in proline codons 
appeared at both positions A and P sites, suggesting limiting availa- 
bility of proline (Pro)-tRNA for protein synthesis (Fig. 3a). The rel- 
ative higher level of non-aminoacylated, but not total Pro-tRNAs in 
the tumour compared with normal tissue supports this suggestion 
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PCR (qRT-PCR) and normalized to GAPDH. h, A498 cells were depleted 
from glutamine, and then supplemented with proline in the indicated 
concentrations. PYCR1 levels were measured by qRT-PCR and normalized 
to GAPDH mRNA. i, Diricore analysis of a second ccRCC tumour. 

+P <3.4 x 10-8 for the ATG codon by out of frame analysis. j, Pro-tRNA 
and control Leu-tRNA aminoacylation analysis of tumour and normal 
tissues. k, Pro-tRNA aminoacylation analysis was performed with the 
786-O ccRCC cell line subjected to proline deprivation and PYCR1 
knockdown using short interfering RNA (siRNA), as indicated. 1, The 
same experimental setup as in k. Cells were counted 72 h after treatment 
to assess proliferation. For c and d, The box indicates the 25th/50th/75th 
percentiles (bottom/middle/top bar), the extent of the whiskers indicate 
the most extreme values that are within 1.5 times the IQR (inter-quartile 
range) of the hinge (as per R package ggplot2/geom_boxplot). For b, g, h, 
1, j, and k error bars represent s.d.; n = 3 independent biological replicates; 
*P < 0.05; **P < 0.01 by Student's t-test. Cutoff for RPF density shift plots 
in i was 15 reads per window. 


(Fig. 3b, Extended Data Fig. 4e). Intriguingly, like L-asparaginase treat- 
ment (Fig. 1g), gene expression analysis of proline metabolic enzymes 
revealed vast upregulation of PYCR1, a key enzyme that catalyses the 
last step in proline synthesis’, and also strong downregulation of the 
proline catabolic enzymes PRODH2 and ALDH4A1 (Extended Data 
Fig. 3d). While loss of PRODH2 expression is commonly seen in kidney 
tumours'®, PYCR1 activation is infrequent (Fig. 3c, d). We confirmed 
upregulation of PYCR1 in this tumour by immunohistochemistry 
(Extended Data Fig. 5a). The proline signal in diricore and the changes 
in proline metabolic genes suggest a compensatory feedback mecha- 
nism in the kidney cancer cells to enable proliferation in conditions of 
relative proline shortage. Intriguingly, two other PYCR1-high kidney 
tumours showed similar diricore proline pattern and increased Pro- 
tRNA non-aminoacylation (Extended Data Fig. 6), strengthening the 
link between high PYCRI and proline shortage. 

Proline deficiency in the tumour cells can be a result of increased 
proline demand or decreased proline production, for example due to 
lack of proline precursors. Codon demand measurements revealed 
no global increase in the tumours, even when analysis was restricted 
to differentially expressed genes or those exported extracellularly, 
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Figure 4 | Diricore uncovers tumour-specific proline vulnerability. 
a, PYCR1 expression analysis in invasive ductal breast carcinoma (IDBC) 
compared to normal breast. The box indicates the 25th/50th/75th 
percentiles (bottom/middle/top bar), the extent of the whiskers indicates 
the most extreme values that are within 1.5 times the interquartile 
range (IQR) of the hinge (as per R package ggplot2/geom_boxplot). 
b, Schematic diagram showing the generation of cancer cell lines 
expressing the eGFP-RPL10a fusion protein, the isolation of GFP-tagged 
ribosomes from tumours and cells in culture by immunoprecipitation 
(IP), and the experimental design. Ribo-seq denotes ribosome profiling. 
c, d, Diricore analysis of tumours derived from eGFP-RPL10a-expressing 
SUM159PT cells. +Out-of-frame analysis P< 3.15 x 107°, P< 3.13 x 10-3, 
P<5.31 x 107, and P< 1.8 x 107! for the CCC, CCT, CCA and CCG Pro 
codons, respectively. +Out of frame analysis P< 3.9 x 10-°, P< 9.4 x 1073, 
P<1.17 x 107, and P< 1.0 x 107! for the CCC, CCT, CCA and CCG 


or when rate of translation was considered (Fig. 3e, fand Extended 
Data Fig. 5c). We therefore investigated the regulatory events lead- 
ing to PYCRI activation, rationalizing that this can be a clue to the 
proline deficiency. Particularly, we asked whether limiting amount of 
glutamine, the precursor of proline (Extended Data Fig. 3d), or low 
proline levels, is causal to activation of PYCR1. We selected A498, a 
ccRCC cell line with moderate PYCR1 expression (Extended Data 
Fig. 5d), depleted glutamine, and measured PYCRI messenger RNA 
levels. This analysis revealed a strong and significant activation 
of PYCR1, but not its paralogues, following glutamine deprivation 
(Fig. 3g and Extended Data Fig. 5e). However, addition of proline 
to the glutamine-depleted cells did not attenuate PYCR1 activation 
(Fig. 3h), suggesting PYCRI1 responsiveness to a precursor substrate in 
the metabolic pathway from glutamine to proline. 

Interestingly, when we examined an additional ccRCC tumour by 
diricore, a specific, significant and robust signal was identified in the 
initiator- ATG, which resembled the signal observed in the first tumour, 
but not at proline codons (Fig. 3i and Extended Data Fig. le). Also in 
this tumour, the signal in the initiator-ATG correlated with global acti- 
vation of translation initiation (Extended Data Fig. 5f). In contrast, the 
lack of proline signal suggested sufficient proline. Indeed, this tumour 
presented no increase in non-aminoacylated Pro-tRNA levels, no 
change in total Pro-tRNA levels, and no upregulation of PYCR1 or 
any other related gene (Fig. 3j and Extended Data Figs 4f and 5f, g). 
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Pro codons, respectively. NS, not significant by out-of-frame analysis. 

e, Immunoblot analysis of SUM159PT cell clones expressing CRISPR- 
Cas9 with either a non-targeting (control) or two independent single- 
guide RNAs (sgRNAs) against PYCR1 (knockout, KO). f, g, The same 
SUMI159PT clones as in e were propagated either in vitro or in vivo, and 
cell proliferation and tumour volume (n= 6) were monitored in time. 

h, The same SUM159PT cell clones as in e were deprived of glutamine 
and grown with additional proline, as indicated. The number of cells 
was counted 48 h after treatment. i, Analysis of the relative levels of 
non-aminoacylated tRNAs was performed with the same cell clones 

as in e deprived of glutamine for 18 h. Error bars represent s.d.; n= 3 
independent biological replicates; *P < 0.05, ***P < 0.001 by Student's 
t-test. For g, n refers to the number of xenografts per condition. NS, non 
significant. Cutoff for subsequence shift plots in c was 50 reads per gene. 


Immunohistochemistry with CD31, a marker for blood vessels, 
revealed higher vascularity of this tumour compared with the first, 
providing a potential evidence for efficient nutrient supply and 
an explanation for the lack of diricore proline signal (Extended Data 
Fig. 5h). 

Next, we tested whether interference with the proline/PYCRI reg- 
ulatory pathway can affect cellular proliferation, and used 786-O, a 
ccRCC cell line with high PYCR1 (Extended Data Fig. 5d), suggestive of 
proline shortage vulnerability. We knocked down PYCRI, and cultured 
cells in medium containing dialysed serum with or without proline to 
mimic proline shortage (Extended Data Fig. 5i). Indeed, PYCR1 knock- 
down increased non-aminoacylated Pro-tRNA levels, and addition of 
proline attenuated this effect (Fig. 3k). Notably, PYCR1 knockdown 
reduced proliferation, which was negated by proline addition (Fig. 31), 
supporting the assumption that interfering with intracellular proline 
production attenuates proliferation when proline availability is limiting. 

High PYCRI1 is commonly observed in invasive ductal breast car- 
cinoma (IDBC, Fig. 4a), possibly indicating proline shortage when 
grown in vivo. To test this hypothesis, we generated SUM159PT/ 
eGFP-RPL10a, a PYCR1-high IDBC cell line (Extended Data 
Fig. 7a) expressing enhanced green fluorescent protein (eGFP)-tagged 
ribosomal protein RPL10a. As expected'’, eGFP-RPL10a protein 
was incorporated into polysomes to be used for ribosome-isolation 
using anti-GFP antibodies (Extended Data Fig. 7b, c). Quality control 
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experiments indicated very similar profiles as the protocol we rou- 
tinely use for ribosome profiling (Extended Data Fig. 7d, e), enabling 
us to apply diricore to tumours grown in vivo. We then either trans- 
planted SUM159PT/eGFP-RPL10a cells into immuno-compromised 
mice, or cultured them ex vivo (Fig. 4b). Following expansion, cells and 
tumours were harvested for immunoprecipitation and diricore analysis. 
Intriguingly, comparing tumours with cultured cells indicated specific 
signals in all four proline codons position 15, three of which were sig- 
nificant (Fig. 4c, d). Randomization analysis confirmed robust signal 
(Extended Data Fig. 1f). Control comparisons produced no signal, 
and an independently repeated experiment produced similar results 
(Extended Data Fig. 8a and b). These results indicate that limiting levels 
of proline are experienced by SUM159PT during tumorigenic devel- 
opment in mice. 

As diricore signals in proline were linked to PYCR1 expression levels, 
we interrogated the impact of PYCR1-loss on tumour development 
using two PYCR1 knockouts (KO) and one control SUM159PT mon- 
oclonal cell lines (Fig. 4e). Interestingly, while PYCR1-KO cells prolif- 
erated normally ex vivo (Fig. 4f), their capacity to expand in vivo was 
diminished (Fig. 4g and Extended Data Fig. 9a). To further link pro- 
line deficiency and cell growth, we cultured these cell lines in medium 
with limiting amounts of glutamine, the major precursor of proline. 
While control cells were only relatively mildly affected by glutamine 
deprivation, PYCRI1-KO cells showed a greater sensitivity (Fig. 4h). 
Importantly, proline addition improved proliferation of PYCR1-KO, 
but not control, cells following glutamine deprivation. We confirmed 
increased non-aminoacylation of Pro-tRNAs following glutamine dep- 
rivation, augmentation of this effect in PYCR1-KO cells, and its atten- 
uation following proline addition (Fig. 4i). Thus, in growth limiting 
cellular environments, such as glutamine deprivation and expansion 
in vivo, PYCR1 expression is required for extensive proline production 
to maintain tumorigenic growth (Extended Data Fig. 10). 

Altogether we show that ribosome profiling can sense amino acid 
shortages in cells and tumours. Although approaches to directly meas- 
ure amino acids in tumours and bodily fluids exist, they are unlikely 
to yield biologically meaningful results as tumour demand, energy 
requirement, and genetic makeup can differ greatly. Our experiments 
uncovered proline codon signals in tumours. Proline signals in transla- 
tion can be caused by the unique and constrained shape of proline!*!® 
which is resolved by the translation factor elF5A?”"!. However, the 
increased levels of eIF5A in kidney tumours regardless of their pro- 
line signature, and the lack of a higher signal at poly-proline motifs 
(Extended Data Fig. 9b, c), disfavour such a possibility. In contrast, 
tRNA-aminoacylation assays indicated proline shortage as the cause of 
the proline signal. The link we uncovered between proline codon sig- 
nals and elevated proline production pathway indicated compensatory 
mechanism and vulnerability for tumour expansion. Indeed, reducing 
cellular proline production impaired tumorigenic growth, providing 
indications for the usage of ribosome profiling for detecting tumour 
vulnerabilities and for rational targeting proline for cancer therapy. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 
Cell culture. SUM1315MO2 cells were cultured in DMEM/F12 1:1 medium 
supplemented with 5% fetal bovine serum (FBS), EGF (10 ng ml7!), and insulin 
(5 pgml~) in 5% CO, at 37 °C. SUM159PT cells were cultured in DMEM/F12 
1:1 medium supplemented with 5% FBS, insulin (5 j1g ml), and hydrocortisone 
(1 pg ml~!). PC3, 786-O, and A498 cells were cultured in RPMI medium supple- 
mented with 10% FBS. The breast cancer cell lines MCF7, T47D, MDA-MB-231, 
and Hs578T were cultured in DMEM medium supplemented with 10% FBS. All 
cell lines were obtained from ATCC, except for SUM159PT and SUM1315MO2, 
which were obtained from Asterand Bioscience. Cell lines were regularly tested for 
Mycoplasma contamination. Cell lines were authenticated by expression analysis 
based on RNA-seq and/or ribosome profiling. For glutamine starvation experi- 
ments, cells were cultured for 48h in glutamine-free DMEM supplemented with 
10% of dialysed FBS. 

siRNAs against PYCR1 were purchased from Life Technologies (PYCR1 siRNA 
catalogue number 4390824, ID s194735; Negative control siRNA catalogue num- 
ber 4390844). Cells were transfected using Dharmafect I reagent (Dharmacon) 
following the manufacturer's instructions. 
Immunoblotting. Immunoblots were performed as described previously*. The 
following antibodies and dilutions were used, anti-GFP (Santa Cruz Biotechnology, 
sc-8339, 1:1,000), anti-RPL10a (Santa Cruz Biotechnology, sc-100827, 1:1,000), 
anti-RPS6 (Cell Signaling Technology, 2217, 1:1,000), anti-RPL13a (Cell Signaling 
Technology, 2765, 1:1,000), anti-PYCR1 (Proteintech, 13108-1-AP, 1:1,000). 
Immunohistochemistry. Immunohistochemistry of the FFPE tumour samples 
was performed on a BenchMark Ultra autostainer (Ventana Medical Systems). 
Briefly, paraffin sections were cut at 3 1m, heated at 75 °C for 28 min and depa- 
raffinized in the instrument with EZ prep solution (Ventana Medical Systems). 
Heat-induced antigen retrieval was carried out using Cell Conditioning 1 (CC1, 
Ventana Medical Systems) for respectively 32, 36 and 64 min at 95 °C (CD31, 
phospho-4E-BP1 and PYCR1). CD31 was detected using clone JC/70A (1:300 dilu- 
tion, 32 min room temperature, ThermoScientific), Phospho-4E-BP1 with clone 
236B4 (1:750 dilution, 1 h room temperature , Cell Signaling), and PYCR1 using a 
polyclonal antibody (1:4,000 dilution, 1 h room temperature , ProteinTech). Bound 
phospho-4E-BP1 was detected using the Ultra View Universal DAB Detection Kit 
(Ventana Medical Systems), while detection for CD31 and PYCR1 was visualized 
using the OptiView DAB Detection Kit (Ventana Medical Systems). Slides were 
counterstained with haematoxylin. 
Ribosome profiling (Ribo-seq). Libraries from cultured cells were prepared as 
described previously”. For ribosome profiling from tissue, samples were snap 
frozen after surgical removal and lysed mechanically using a tissue homoge- 
nizer in the presence of ice- cold lysis buffer (20 mM Tris-HCl, pH 7.8, 100 mM 
KCl, 10 mM MgCl2, 1% Triton X-100, 2 mM DTT, 100 pg ml“! cycloheximide, 
1x EDTA-free Complete protease inhibitors). Lysates were centrifuged at 
5,000 r.p.m. and the supernatant was digested with 2 Ul! of RNase I (Life 
Technologies) for 45 min at room temperature. Resulting monosomes were 
purified, RNA was isolated, and ribosome profiling libraries were prepared as 
described previously. Studies with patient material were approved by the Medical 
Ethical Committee of The Netherlands Cancer Institute. An informed consent was 
obtained from all the subjects. 
eGFP-RPL10a immunoprecipitation. SUM-159 eGFP-RPL10a cells were treated 
with cycloheximide at a final concentration of 10j1gml” ‘for 5 min. Cells were 
washed with cold PBS, scraped, and pelleted. Cell pellet was lysed in 1 ml of NP40 
lysis buffer (20 mM Tris-HCl, pH 7.8, 10 mM MgCl2, 150 mM KCI, 1% NP40, 
2mM DTT, 1x Complete protease Inhibitors, 100 ug ml! CHX) for 15 min, cen- 
trifuged for 10 min at 1,300g and 1 ml of supernatant was recovered. For tumours, 
each sample was lysed in 1 ml of NP40 lysis buffer in a tissue homogenizer. Tumour 
lysate was centrifuged at 1,300g for 10 min and the supernatant was recovered. 
85 il per condition of GFP-Trap_M beads (Chromotek) were washed for 3 times in 
1 ml of NP40 lysis buffer, resuspended in 3 ml of NP40 lysis buffer. 1 ml of cleared 
lysate (cell line or tumour) was added to the beads and digested with RNase I 
(100 Ul?) for 1 h at room temperature under constant rotation. Beads were 
washed 3 times with 1 ml of NP40 Lysis buffer and 3 times with NP40 wash buffer 
(20 mM Tris-HCl pH 7.8, 10 mM MgCh, 350 mM KCl, 1% NP40, 2 mM DTT, 
1x Complete protease Inhibitors, 100 j1gml-' CHX). Beads were resuspended in 
300 il of lysis buffer with 1% SDS and 15 \1l of PCR-grade proteinase K (Roche) 
and incubated for 1 h at 45 °C. Supernatant was recovered and resuspended 
in TriSure. RNA was isolated and libraries prepared according to the ribosome 
profiling protocol. 
tRNA aminoacylation. Cells were harvested in cold PBS, centrifuged, and resus- 
pended in 0.3 M sodium acetate/acetic acid (NaOAc/HOAc) (pH 4.5). Total 
RNA was isolated using acetate-saturated phenol/CHC]; (pH 4.5). Precipitated 
RNA was resuspended in 10 mM NaOAc/HOAc (pH 4.5). Samples were split in 
two, one half (2 jug) was oxidized with 50 mM NalO, in 100 mM NaOAc/HOAc 
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(pH 4.5) for 15 min and the other half (2 jug) was incubated in 50 mM NaCl in 100 
mM NaOAc/HOAc (pH 4.5) for 15 min. Samples were quenched with glucose 100 
mM for 5 min at room temperature, purified in G50 columns (GE Healthcare), 
and then precipitated with ethanol. tRNAs were deacylated in 50 mM Tris-HCl 
(pH 9) for 30 min at 37 °C. RNA was precipitated and then ligated to the 3/adaptor 
(5'-/5rApp/TGGAATTCTCGGGTGCCAAGG/3ddC/-3’) using T4 RNA ligase 2 
(NEB) for 2 h at 37 °C. Relative aminoacylation levels were calculated by gRT-PCR 
using tRNA specific primers. 

Reverse primer, GCCTTGGCACCCGAGAATTCCA; Pro-tRNA(AGG) primer, 

GGCTCGTTGGTCTAGGGGTATG; Pro-tRNA(TGG) primer, CAGAACTAT 
AGGAATTGAACCTA; Pro-tRNA(CGG) primer, GTTGGTCTAGGGGTATGAT 
TCTCGCTTC; Leu-tRNA(CAA) primer, GTCAGGATGGCCGAGTGGTCTA; 
Leu-tRNA (CAG) primer, G(CAGGATGGCCGAGCGGTCTA,; Val-tRNA 
(TAC) primer, GGTTCCATAGTGTAGTGGTTAT; Asn-tRNA(GTT) primer, 
GTCTCTGTGGCGCAATCGGT. 
Real-time PCR. 1 j1g of total RNA was reverse transcribed using the SuperScript 
III first-strand synthesis system (Life Technologies) following the manufacturer's 
instructions. Real-time PCR was performed using the SensiFAST SYBR real time 
PCR kit (Bioline). 

The following primer sequences were used, PYCR1 forward primer, CATG 
ACCAACACTCCAGTCG; PYCRI reverse primer, CCTTGGAAGTCCCATC 
TTCA; PYCR2 forward primer, ATAGCCAGCTCCCCAGAAAT; PYCR2 
reverse primer, CTCCACAGAGCTGATGGTGA; PYCRL forward primer, 
GCCTCATCAGAGCAGGAAAA; PYCRL reverse primer, CACGGACACCAA 
GATGTGTT; GAPDH forward primer, ACCCAGAAGACTGTGGATGG; 
GAPDH reverse primer, TCTAGACGGCAGGTCAGGTC. 

Deep sequencing data preprocessing and alignment. Adaptor sequences were 
trimmed from raw ribosome profiling data using cutadapt”*. Sequences shorter 
than 20 bp after adaptor trimming were discarded. rRNA and tRNA sequences 
were filtered by alignment to indices of rRNA and tRNA sequences respectively, 
using bowtie2*4 and default parameters. A rRNA index was constructed from 
GENCODE v19 annotations, transcript types “rRNA? “Mt_rRNA’ and “rRNA_ 
pseudogene’, supplemented with UCSC repeats of class “rRNA” (table “rmsk”). 
The tRNA index was constructed from sequences obtained from GtRNAdb” at 5 
September 2012. The remaining (unmapped) reads were aligned to GRCh37/hg19 
using TopHat2”° and GENCODE v19/BASIC” transcript coordinates (parameters 
“-N 2 -m 1-no-novel-juncs-no-novel-index-no-coverage-search-segment-length 
25”). In subsequent analyses, only primary alignments with mapping quality of 
10 or greater were considered. Ribosome profiling and RNA-seq data followed the 
same pipeline, with the exception of adaptor trimming, which was only performed 
on ribosome profiling data. Data presented in this manuscript is available with 
GEO accession number GSE59821. 

Subsequence analysis. In summary, subsequence shift analysis compares RPF 
codon occupancy frequencies between samples, in a gene-level normalized man- 
ner (to exclude gene expression differences as cause for differences in observed 
codon frequencies). 

Specifically, gene IDs and reading frames were assigned to RPFs, using 
GENCODE v19/BASIC. RPFs not within a valid CDS (taking into account the 
15-nucleotide 5’-overhang of RPFs), RPFs with ambiguous gene ID and RPFs 
with ambiguous reading frame were excluded. The remaining RPFs were used 
to count, for the different positions (12 and 15 nucleotides from 5’-end), the 
frequencies of all codons, for all genes in the transcriptome. Within-gene fre- 
quencies were calculated by dividing the observed counts by the total counts 
for that gene. Normalized codon frequencies were averaged over all genes that 
had at least n counts in both the condition and control samples (n is 100 for all 
figures, unless otherwise specified). Shifts were calculated from these normalized 
and averaged frequencies per codon as: (condition - reference)/reference. That 
is, per codon shifts were calculated relative to the control. To test for significant 
differential codon occupancy at the amino acid level, we used the subsequence 
shifts between control and conditions over the replicates. We used a linear mixed 
model (R package ‘Ime4’) with fixed effects over the 20 amino acids and random 
effects over the codons. This supplied us with t-values and P-values. We used 
Benjamini-Hochberg multiple testing correction (R function ‘p.adjust, method 
“fdr”) to extract adjusted P-values. For schematic representation of the method 
see (Extended Data Fig. 1g). 

Code availability. Code is available upon request. 

RPF density analysis. Codon-regions of 61 nucleotides width around designated 
codons along the transcriptome (as annotated by GENCODE v19/BASIC) were 
identified (using transcript coordinates; that is, codon-regions were all supported 
by exons). Overlapping transcript annotations were used, and overlapping regions 
were retained, but regions with identical genomic coordinates were collapsed. 
Codon-regions that could not be extended to 61 nucleotides (due to being near 
the 5’ or 3’ ends of transcripts) were discarded. 
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5! ends of RPFs were counted for each codon-region. For a comparison between 
two samples, only codon-regions where the total number of counts was at least n in 
both samples were taken into consideration (n was 50 for all figures, unless other- 
wise specified. n was lowered in some data sets to include at least 1,000 windows for 
common codons). For each sample and each codon-region, the normalized 5’ end 
RPF density was calculated by dividing over the total number of counts within that 
region, and multiplying by the width of the codon-region, so the average density 
within each codon region is equal to 1. The normalized densities were convolved 
using a rectangular window of width 3 and height 1/3. The mean density over the 
codon-regions was taken and the difference in the mean densities between the 
samples yielded to the density shift. 

Out-of-frame analysis. We performed out-of-frame analysis to ensure that the 
signals we observed in the cases where we have no replicate samples (3-AT main 
text Fig. 1b, harringtonine main text Fig. 1d, second ccRCC tumour from main 
text Fig. 3i) are robust and significant. A background distribution of values was 
generated using —1/+1 nucleotide. Shifts from the analysed codon. From this 
background distribution we excluded values implicitly caused by real signal (for 
example, for harringtonine, which causes a signal at ATG in the 12th position, we 
used the nucleotide triplets observed in the 11th and 13th position, but excluded 
NAT and TGN at the 11th and 13th position, respectively). We then used this 
background distribution to measure the significance of the diricore signal in the 
original (+0) position using Z-tests (we ensured that values in the background 
distribution were approximately normally distributed using visual inspection of 
q-q plots, the Anderson-Darling test and the Shapiro-Wilk test). 

Diricore analysis of randomized data sets. Aligned reads were displaced by 
—5 to +5 nucleotides. Since alignments are done against the genome transcript 
coordinates/introns were taken into account and reads were effectively displaced 
along the transcript. In case of inconsistent transcript isoforms, leading to ambig- 
uous location after displacement, reads were discarded (this occurred with less 
than 1% of the reads). For every sample, three independent randomizations were 
performed. Only reads in the CDS and in-frame with the CDS (before displace- 
ment) were taken into account (for details, see Methods section Subsequence 
analysis). For further diricore analysis, we used all reads post displacement, with 
all other methods identical (see sections Subsequence analysis and RPF density 
analysis). 

Normalized abundances and translational efficiencies. Gene-wise counts were 
performed using annotations from GENCODE v19/BASIC and HTSeq. Counts 
were normalized by correction for the total library sizes using trimmed-mean of 
M-values (TMM) normalization, supplied by the R-package ‘edgeR’ Translational 
efficiencies were calculated as the ratio of (normalized abundance determined by 
ribosome profiling)/(normalized abundance determined by RNA-seq). 

GSEA. To obtain a RankedList as input for GSEA”’, gene-wise counts (generated 
using GENCODE v19/BASIC and HTseq) were normalized using edgeR's*” TMM 
normalization. Predictive fold-changes were obtained using edgeR's predFC rou- 
tine with a prior count of 3. Ensembl/GENCODE gene IDs were converted to 
Entrez gene IDs using a look-up table obtained from Ensembl MartView. Fold- 
changes of genes that mapped to a single Entrez gene ID were averaged. In GSEA, 
the predictive fold-changes were used a scores. The gene sets analysed contained 
all MSigDB/C2 (v4.0) gene sets. Default settings were used, with the exception 
that 10,000 permutations were used to estimate the significance of enrichments. 
Codon demand calculations. Codon demand was calculated by calculation of the 
absolute abundances of each codon in the transcriptome, as determined by RNA-seq. 
Reads per gene ID were counted using GENCODE v19/BASIC annotations 


and HTSeq. Gene-wise abundances were normalized by the total library sizes. 
Per-gene isoform abundances were estimated using MISO*”. The total numbers 
of codons present per transcript were calculated using the CDS annotations. Shifts 
in codon demand were calculated relative to the reference, that is, the shifts were 
calculated as (condition — reference)/ reference. 

Processing of yeast data. Yeast ribosome profiling data of cells treated with 3-AT?° 
was obtained from GEO (samples GSM1279568 and GSM1279579). Adaptor 
sequences were trimmed using cutadapt”*. Sequences shorter than 20 bp after 
adaptor trimming were discarded. rRNA and tRNA sequences were filtered by 
alignment to indices of rRNA and tRNA sequences respectively. A rRNA index 
was constructed from Ensembl annotations (EF4.69), transcript type “rRNA”). 
Similarly, a tRNA index was constructed from Ensembl annotations (EF4.69), 
transcript type “tRNA”). Remaining (unmapped) reads were aligned to the 
yeast genome (EF4.69) using corresponding Ensembl transcript annotations 
and TopHat2” (using parameters “-N 2 -m 1-no-novel-juncs—no-novel-index- 
no-coverage-search-segment-length 25”). In subsequent analyses, only primary 
alignments with mapping quality of 10 or greater were considered. 

Animal studies. The Animal Ethics Committee of The Netherlands Cancer 
Institute approved all the experiments in this study. 5—7-week-old, female 
NOD-SCID IL2Rg null (Charles River) were used in all the experiments. 
5 x 10° cells were injected in the fat pads and tumour growth was measured over 
time. Experiments in Fig. 4 were repeated twice at independent times. The values 
reported represent the mean of 6 xenografts. Measurements were taken every 
3-5 days and tumour volume was estimated by 0.5 x W x W x L, where W is 
the width and L is the length of the tumour. None of the experiments exceeded 
the limit of the maximum tumour allowed by the NKI institutional guidelines. 
Maximum tumour size allowed was 1,000 cubic millimetres. Distributions were 
assumed to follow a Student's t distribution. No animals were excluded from the 
analysis and they were assigned to groups by simple randomization. Experiments 
were not conducted blind. No statistical methods were used to predetermine 
sample size. 

CRISPR-cas9-mediated knockout. PYCR1 was targeted using the lenti- 
CRISPR-V2 and sgRNAs to TCTCCGGACAGCATGAGCGT (KO1) and 
CTTCATCGGCGCTGGCCAGC (KO3) sites. 
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Extended Data Figure 2 | Supporting evidence for the asparagine 
signal in diricore. a, Diricore analysis of PC3 cells treated for 48 h with 
L-asparaginase reveals a specific signal in asparagine codons at position 
15, two replicates are shown. b, Subsequence shift data from Fig. 1f, 

and a, in which significant shifts at the amino-acid level are displayed 
(***FDR < 0.001). c, Subsequence shift at in-frame and out-of-frame 
positions (8-21 nucleotides from RPF 5’-ends). d, Volcano plot showing 
gene expression changes (as determined by ribosome profiling) upon 
ASNase treatment. The horizontal axis shows log-fold-changes, while the 


Asn-GTT Asn-ATT Leu-TAG Val-TAC 


vertical axis shows the statistical significance of the differential expression. 
Blue dots indicate genes annotated with GO term “cellular amino 

acid metabolic process” (GO:0006520), while red dots indicate genes 
annotated with GO term “asparagine metabolic process” (GO:0006528). 

e, Schematic diagram of a protocol based on Zaborske et al. 13 to compare 
aminoacylated and non-aminoacylated tRNAs by qRT-PCR. f, Relative 
total tRNA levels from Fig. 1h, Leu-CAG tRNA was used to normalize 

the values. The error bars in f represent s.d.; n = 3 independent biological 
replicates; NS, non significant by Student's t-test. 
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Extended Data Figure 3 | Quality control of ribosome profiling from 
tumour and normal ccRCC tissue, and gene expression analysis of key 
ccRCC tumour markers. a, Left, gross presentation of ccCRCC. Depicted 
areas indicate normal and tumour tissue. Arrows indicate regions where 
the samples were taken. Right, representative microscopic images of 
tumour sections stained with haematoxylin and eosin (H&E). Pathological 
analysis confirmed a clear cell renal cell carcinoma (ccRCC). b, Quality 
control of the Ribo-seq data obtained from Normal and Tumour tissues. 
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Left, correlation of Ribo-seq data from independent samples. Right, 

RPF periodicity from all the samples. c, Gene expression analysis by 
Ribo-seq confirms high level of EGFR, VIM, and CA9 in the tumour 
samples compared with the normal tissues, as was reported for renal 

cell carcinomas (Cancer Genome Atlas Research, 2013). d, Schematic 
diagram of the proline synthesis pathway (upper panel). Changes in gene 
expression in the proline synthesis pathway based on Ribo-seq. 
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Extended Data Figure 4 | Diricore analysis of additional tissue samples 
of ccRCC tumour 1, and a combined statistical analysis of signals at 
positions 12 and 15. a, Diricore analysis as in Fig. 3a. b, Subsequence 
shifts calculated by counting codon occurrences in specific positions 
within RPFs, normalized within and averaged over genes, with respect to 
untreated controls. A mixed linear model, in which amino acids are fixed 
effects and codons are random effects, is used to test for deviating shifts 
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at the amino acid level (***FDR < 0.001, **FDR< 0.01, *FDR<0.1). 

c, Diricore analysis between samples N1 and N2 from normal kidney 
(upper panel) and between samples T2 and T4 (lower panel). d, Diricore 
analysis between samples T1 and N2 does not show a consistent signal 
at Ala codons. e, f, Relative total tRNA levels from Fig. 3b, j, respectively. 
Leu-CAG tRNA was used to normalize the values. For e and f, error bars 
represent s.d.; n = 3 independent biological replicates. 
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Extended Data Figure 5 | Supporting evidence for global stimulation 
of translation in ccCRCC tumours in study. a, Immunohistochemistry 
performed on a section containing tumour and normal tissues (T and 

N, respectively) using anti-p4EBP1 threonine 37/46 and anti-PYCR1 
antibodies. b, Diricore analysis of MCF 10a cells treated with either Torin 1 
or deprived of nutrients for 2 h. Methionine codons at the Start and 

CDS are shown. ¢, Proline codons content was interrogated using 
RNA-seq data from the tumour presented in Fig. 3. The analysis was 
performed with a subset of genes significantly (FDR < 0.05) altered in the 
tumour compared with normal tissue (up or down). The box indicates 
the 25th/50th/75th percentiles (bottom/middle/top bar), the extent of 
the whiskers indicate the most extreme values that are within 1.5 times 
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the IQR (inter-quartile range) of the hinge (as per R package ggplot2/ 
geom_boxplot). d, RT-PCR analysis of the relative expression of PYCR1 
in 786-O and A498 ccRCC cell lines. e, gRT-PCR analysis of the relative 
expression of PYCR2 and PYCRL in the A498 ccRCC cell line after 48h 
of glutamine deprivation. f, Immunohistochemistry analysis detects high 
levels of p4EBP1 threonine 37/46 but no PYCRI signal. g, A scheme of the 
common tumour changes in gene expression. In particular, while down 
regulation of PRODH2 and ALDH4A1 was clearly observed, no strong 
activation of PYCRI was noted. h, Immunohistochemistry analysis using 
anti-CD31 antibodies. i, Knockdown of PYCR1 in 786-O cells expressing 
high levels of PYCR1 (d), expression is normalized to GAPDH mRNA. 
Error bars represent s.d.; scale bar, 100 jum. 
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Extended Data Figure 6 | Supporting evidence for the link between relative levels of non-aminoacylated tRNAs in ccRCC tumours. For d and 
the proline signals in diricore and high PYCR1 expression level. f, error bars represent s.d.; 1 = 3 independent biological replicates; n =3 

a, b, Two additional ccRCC and normal samples were subjected to diricore independent biological replicates; *P < 0.05 by Student’s t-test. Cutoff for 
analysis. Tumour specific signals at the methionine and proline codons are _ subsequence shift plots in this figure was 25 reads per gene. Cutoff for RPF 
highlighted. c, e, Gene expression analysis based on Ribo-seq shows that density shift plots in this figure was 25 reads per window. 

the selected tumours express high levels of PYCRI1. d, f, Analysis of the 
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Extended Data Figure 7 | Usage of eGFP-RPL10a fusion protein for 
ribosome profiling in a high PYCR1 expressing ccRCC cell line. 

a, PYCR1 mRNA levels measured by qRT-PCR in a panel of breast cancer 
cell lines. Values were normalized to GAPDH mRNA. b, Immunoblot 
analysis from sucrose gradients shows the incorporation of the eGFP- 
RPL10a fusion protein in polysomes. c, Immunoprecipitation experiments 
with beads coated with antibodies against GFP followed by western blot 
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analysis in SUM159PT cells expressing eGFP-RPL10a. d, Correlations 
between independent Ribo-seq experiments where ribosomes where 
purified by sucrose gradients (SG) or by GFP immunoprecipitation. MCF7 
eGFP-RPL10a cells were used in this experiment. e, RPF reads profiles 
from Ribo-seq experiments generated from sucrose gradients or GFP 
immunoprecipitation. Error bars represent s.d. 
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Extended Data Figure 8 | Additional tumour vs cultured 
eGFP-RPL10a-expressing SUM159PT conditions. a, b, Diricore 
analysis of an independent experiment of tumours derived from 
eGFP-RPL10a-expressing SUM159PT. A specific signal in proline codons 
at position 15 is observed. 


© 2016 Macmillan Publishers Limited. All rights reserved 


a 
2 30 — Control 
Bae —— PYCR1 KO1 
2 200 —- PYCR1 KO3 
= 
5 100 
fe} 
E 0 
= 5 15 25 30 35 40 
Days 
c 


Tumor 1 vs Cultured 1 
Y_PPG y PPP 


os 


Tumor 2 vs Cultured 2 
y_PPG y PPP 
0s : 


My A 


os H 
30-15 0 15 30 -30 15 0 15 30 


Relative read density 


Position respect to codon 


Extended Data Figure 9 | Additional data supporting a role for proline 
production pathway in tumour growth in vivo. a, Tumour volume of 
clonally derived SUM159PT cells expressing a non-targeting ssRNA 
(control) or two independent sgRNAs against PYCR1 (KO). b, Gene 
expression analysis by RNA-seq shows high levels of eiF5A in the tumour 
samples compared to normal tissues. c, RPF density shift plots at amino 
acid motifs PPG and PPP (left panels) on the in vivo ribosome profiling 
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data set of Fig. 4c, d. The first nucleotide of the first codon of the motif is 
located at the 0-position. RPF density shift plots at proline and cysteine 
codons are shown in the middle and right panels, respectively. For the 
PPG and PPP motifs the cutoff of RPFs per window was lowered to 

10 (from the usual 50) in order to compensate for the fewer windows with 
sufficient coverage. n= 6. Error bars represent s.d.; n refers to the number 
of xenografts per condition. 


© 2016 Macmillan Publishers Limited. All rights reserved 


LETTER 


Enriched i ita vaiiza 
Gln 


Other | 


sources 
PYCR1 


| 


Proline 


tRNA-Pro / ~ 


Bio-energy / 
Celullar 
building blocks 


Profile of 
ribosomes 


Other i 


sources 
PYCR1 


| 


Proline 
tRNA-Pro s 


3 \ 
Bio-energy / 


Celullar 
building blocks 


Profile of 
ribosomes 


Tumor expansion 


Extended Data Figure 10 | A schematic diagram modelling our findings. 
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Epithelial tricellular junctions act as interphase cell 
shape sensors to orient mitosis 


Floris Bosveld!, Olga Markova!, Boris Guirao!, Charlotte Martin'+, Zhimin Wang!, Anaélle Pierre”, Maria Balakireva', 
Isabelle Gaugue!, Anna Ainslie!}, Nicolas Christophorou!+, David K. Lubensky!*, Nicolas Minc? & Yohanns Bellaiche! 


The orientation of cell division along the long axis of the interphase 
cell—the century-old Hertwig’s rule—has profound roles in tissue 
proliferation, morphogenesis, architecture and mechanics’. In 
epithelial tissues, the shape of the interphase cell is influenced by 
cell adhesion, mechanical stress, neighbour topology, and planar 
polarity pathways*"!*. At mitosis, epithelial cells usually adopt a 
rounded shape to ensure faithful chromosome segregation and to 
promote morphogenesis!. The mechanisms underlying interphase 
cell shape sensing in tissues are therefore unknown. Here we show 
that in Drosophila epithelia, tricellular junctions (TCJs) localize 
force generators, pulling on astral microtubules and orienting cell 
division via the Dynein-associated protein Mud independently of 
the classical Pins/Go; pathway. Moreover, as cells round up during 
mitosis, TCJs serve as spatial landmarks, encoding information 
about interphase cell shape anisotropy to orient division in the 
rounded mitotic cell. Finally, experimental and simulation data 
show that shape and mechanical strain sensing by the TCJs emerge 
from a general geometric property of TC) distributions in epithelial 
tissues. Thus, in addition to their function as epithelial barrier 
structures, TCJs serve as polarity cues promoting geometry and 
mechanical sensing in epithelial tissues. 

To understand how rounded mitotic cells tend to align their spindle 
along their interphase shape long axis, we deciphered the mechanisms 
of spindle orientation in the Drosophila pupal notum epithelium. 
Within this tissue, more than ten thousand cells divide!’, and, as in 
many epithelial tissues, the division of rounded mitotic cells takes 
place in the plane of the tissue and is influenced by their interphase cell 
shape (Extended Data Fig. 1a, b). One possibility is that Pins (verte- 
brate LGN) or Go, polarization orients division as found in single cells 
in culture or during asymmetric divisions'*. However, Pins and Go; 
were homogenous around the cortex (Extended Data Fig. Ic, d). In 
contrast, the distribution of the Dynein-associated protein Mud (verte- 
brate NuMA) suggested a role in orienting the spindle according to the 
interphase cell shape. GFP-Mud (Mud tagged with green fluorescent 
protein) was localized at the spindle poles and unexpectedly was also 
enriched at TCJs where at least three cells meet (Fig. 1a, Extended 
Data Fig. le, h and Supplementary Video 1). Accordingly, in this tissue 
and other pupal or larval epithelial tissue GFP-Mud or endogenous 
Mud co-localized with Gliotactin (Gli), a septate TCJ marker!> (Fig. 1b 
and Extended Data Fig. li-o). Furthermore, we established that in G2 
phase GFP-Mud localizes at TCJs where it persists through mitosis 
(Extended Data Fig. 2). The TCJ localization of Mud was independent 
of Pins and Gq; in both interphase and mitotic cells (Fig. 1c-e and 
not shown). Accordingly, GFP-Mud lacking the Pins binding domain 
(GFP-MudAPins) localizes at TCJs (Fig. 1c). Whereas Mud loss of 
function did not affect Gli localization, loss of Gli led to a reduction 
of GFP-Mud localization at the TCJs (Fig. 1d, e and Extended Data 
Fig. 3a). Likewise, loss of function of the Discs-large (Dlg) septate 


junction protein, which is necessary for Gli localization’ caused the 
disappearance of both Gli and GFP-Mud from the TCJs (Fig. 1d, e and 
Extended Data Fig. 3b-d). Collectively, our results show that inde- 
pendently of the Pins/Ga; pathway, epithelial mitotic cells harbour a 
cortical TCJ Mud distribution inherited from interphase. 

Since astral microtubules contacted ChFP-Mud (Mud tagged with 
cherry fluorescent protein) patches at TCJ (Extended Data Fig. 4a 
and Supplementary Video 2), we asked whether TCJs recruit or acti- 
vate force generators to orient the spindle. Following experiments in 
Caenorhabditis elegans zygotes'®, we developed a laser ablation assay 
to estimate the relative magnitude and the direction of mechanical 
forces exerted by astral microtubules on the centrosome within tissue 
(Extended Data Figs 4b and 5). Astral microtubule ablation in wild- 
type cells caused the centrosomes to recoil away from the ablation 
site, suggesting that microtubules predominantly exert pulling forces 
on spindle poles (Fig. 2a, b and Supplementary Video 3). The loss of 
Mud or Dynein minus-end-directed motor activity led to a reduction 
in centrosome recoil upon microtubule ablation (Fig. 2b). In agree- 
ment with the role of Gli and Dlg in promoting TC] Mud localization, 
centrosome recoil velocities upon microtubule ablation were also 
reduced in Gli and dig mutant cells (Fig. 2b). Together, these results 
indicate that TCJs control the pulling forces exerted by astral micro- 
tubules on the spindle via Mud and Dynein activities. 

We then investigated whether the Mud distribution at TCJs accounts 
for the torque exerted by microtubules on the spindle to dictate its 
orientation. To this end, we adapted a mechanical model predicting the 
spindle orientation according to cell shape!”'®. In this model, devel- 
oped to describe isolated and non-epithelial cells which do not round 
up at mitosis, the pulling forces exerted by astral microtubules scale 
with microtubule length and, as a consequence, the model predicts the 
preferred spindle orientation along the long axis of the cell (Fig. 2c). 
To account for the contribution of Mud to microtubule pulling forces 
in epithelia, we modified the model to assume that astral microtu- 
bules instead pull with a force proportional to the cortical GFP-Mud 
intensity and independent of microtubule length (Fig. 2d). We then 
measured the metaphase distribution of cortical GFP-Mud and cell 
shapes to compare the predictions based on GFP-Mud intensity and 
cell shape models for cells in metaphase (Fig. 2e). Notably, the model 
based on GFP—Mud distribution along the cortex predicted spindle 
orientation and its predictions were significantly better than the ones 
based on the metaphase cell shape (Fig. 2f-h, Extended Data Fig. 6a-i 
and Supplementary Table 1). In agreement with the fact that Pins does 
not regulate Mud localization at TCJs, spindle orientation predictions 
were similar in wild-type and pins mutant tissues (Extended Data 
Fig. 7). To test the contribution of Mud-dependent microtubule pull- 
ing forces to spindle orientation further, we characterized a GFP-Mud 
mutant deleted of its coiled-coil domain (GFP-MudACC, Extended 
Data Fig. 8). GFP-MudACC co-localizes with Gli in wild-type or mud 
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Figure 1 | Mud localizes at TCJs. a, GFP-Mud 
from interphase to telophase (t=0 min, 
anaphase). GFP—Mud at TCJs (arrows), spindle 
poles (arrowheads). n= 21 cells. b, GFP-Mud 
and Gli co-localization in interphase (top, n = 54 
cells) and metaphase (bottom, n =8 cells). 

c, GFP-Mud localization in mud (n= 15), pins 
(n= 22), Ga; (n=5) cells and GFP-MudAPins 
in mud cells (n= 18). d, e, GFP—Mud distribution 
(d, images representative of quantifications 
shown in e) and TC intensities (e) in wild-type 
(WT), Gli, dig and pins cells (mean +s.e.m.). 
Fas3, cell contours. Student's t-test; NS, not 
significant; ***P < 0.0005. Scale bars, 

lym (a-d). 


tissues, but GFP-MudACC cannot restore astral pulling forces in mud _ of microtubule pulling forces, specifying the spindle orientation in 


tissue (Fig. 2i-k). Whereas the GFP-MudACC cortical localization _ epithelial tissues. 


predicted spindle orientation in wild-type tissue (Extended Data Our finding that in metaphase the Mud distribution at TCJs is a 
Fig. 8d), planar mitotic spindles were not oriented according to the _ better predictor of spindle orientation than is cell shape argues 
distribution of GFP-MudACC in mud tissue (Fig. 21). Collectively, against a model where incomplete cell rounding ensures interphase 
these findings indicate that TCJs via Mud define the distribution cell shape ‘memorization. We therefore hypothesized that the 
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Figure 2 | TCJs regulate Mud-dependent 
microtubule pulling forces to orient divisions. a, 
Ablation of astral microtubules (red line), n= 21 
cells quantified in b. b, Mean centrosome velocity 
relative to microtubule ablation site (left), mean 
velocity amplitude after ablation (mean +s.e.m., 
right) in wild-type, mud, dlg and Gli cells at 25°C 
and in wild-type and ghPN cells at 29°C. Student's 
t-test; *P < 0.05. Orientations in mud, dlg and gIPN 
differ from wild type (Watson's U? test; P< 0.01). 
c, d, Cell shape (c) and Mud intensity (d) models: 
pulling forces scale with microtubule length (blue 
arrows) or Mud cortical intensity (red arrows) to 
exert a torque (TT, arrows). e-g, Experimental 
spindle orientation (green cross) and predictions 
based on cell shape (blue circles, f) or GFP-Mud 
intensity (red circles, g) potentials at t= —1 min for 
cell in e (n= 121 cells). AU, arbitrary units; 

WT, wild type. h, Difference between theoretically 
predicted (@:neory) (blue, shape; red, GFP-Mud 
intensity) and experimental (Ojivision) spindle 
orientation. Data are duplicated in a lighter 

colour relative to 0° line in this and subsequent 
plots. Kolmogorov-Smirnov test (P value). 

i, Localizations of GFP-Mud in wild-type (n = 54) 
and mud (n= 15) cells as well as GFP-MudACC in 
wild-type (n= 18) and mud (n= 67) cells. j, 
Quantifications (mean + s.e.m.) of GEP-Mud or 
GFP-MudACC co-localization with Gli in 
wild-type and mud cells. Student's t-test; NS, not 
significant. k, Mean centrosome velocity relative to 
microtubule ablation (left), mean velocity 
amplitude after ablation (mean +s.e.m., right) in 
wild-type and in mud tissues expressing GFP-Mud 
or GFP-~MudACC. Student's t-test; **P < 0.005. 
Orientation in mud, GFP-MudACC differs from 
wild type (Watson's U? test, P< 0.001). 1, Difference 
between Otheory (from cortical GFP intensity) and 
Oaivision in mud cells expressing GFP-MudACC or 
GFP-Mud. GFP-Mud in mud and wild-type 

(h, red) tissue are similar (P= 0.12). Kolmogorov— 
Smirnov test (P values). Scale bars, 1 1m (a, e, i). 
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interphase TC) distribution might account for the Hertwig rule in 
tissues. Since our theoretical analysis indicated that spindle orienta- 
tion is mainly dictated by the anisotropy of the TCJ distribution 
(Extended Data Fig. 6h, i), we introduced a TC) bipolarity quantity 
characterized by an anisotropy (Nrcy) and orientation (@yc;) to 
describe the TC) angular distribution in a given cell (Fig. 3a and 
Extended Data Fig. 6j). The TC) bipolarity anisotropy and orienta- 
tion can be similar to or distinct from the cell shape anisotropy (or 
elongation, 7,,,.) and long-axis orientation (nape 3 Fig. 3a). We 
found that the anisotropy of TCJ bipolarity decreases much less than 
cell shape anisotropy during mitotic cell rounding (Fig. 3b). Also, 
division orientation predictions based on the TC) distribution, 
unlike those based on cell shape, were similar in interphase and 
mitosis (Fig. 3c). These findings support the notion that TCJ bipo- 
larity is a persistent marker of the interphase cell elongation axis 
during mitotic rounding. We then measured each cell’s average shape 
(shape ) and TC] bipolarity (O-rcy) from 60 to 30 minutes before mito- 
sis (from late G2 interphase to before mitotic rounding) as well as its 
division orientation (Ogivision). Apart from cases where TCJ and shape 
orientation are aligned ({@-rcy — Ohape | < 10°), TCJ gives better divi- 
sion orientation predictions than cell shape does, and this improve- 
ment increases as the difference between shape and TC orientation 
increases (Fig. 3d, e, gand Extended Data Fig. 9a). This finding applies 
for both rounded cells (low Nehape) and elongated cells (high 
Wecoe? (Extended Data Fig. 9b, c) and thus raises the question of 
why cells tend to divide according to their interphase cell long axis. 
The distribution of the angular difference between TC] and shape ori- 
entation is broad in rounded cells (low 7),,,,,,.) but narrow in elongated 
cells (high ),,,,.) (Fig. 3h). Accordingly, cell shape does not predict 
the cell division axis in rounded cells, and as cell shape anisotropy 
increases, the predictions based on cell shape agree more and more 
with the predictions based on TC) bipolarity (Fig. 3f, g and Extended 
Data Fig. 9b, c). Hence, in rounded cells TCJ bipolarity and cell shape 
orientations may be misaligned and division orientation follows TC] 
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Figure 3 | TC) distribution accounts for division 
orientation along the interphase cell shape. 
a, TCJ (red dots) bipolarity and cell shape (blue) 
anisotropies 7) and orientations 0 represented 
by the length and orientation of red and blue bars. 
b, Ney ANA 7),,ap¢ {rom interphase to anaphase 
(mean + s.e.m.}. Insets, time-lapse images of a cell 
from interphase to mitotic rounding (n= 249 cells). 
| Prey ~Pavsion| Student’s t-test; *P < 0.05, ***P < 0.0005. ¢, 
@ Metaphase n = 251 Difference between experimental (division) and 
predicted division orientations by the average 
(—20 to —10 min interphase, —4 to —3 min 
metaphase) cell long axis (Ashape) (left) or TCJ 
. bipolarity @7c) (right). Kolmogorov-Smirnov test 
wy, (P values). d, Top, O.hape and Orcy align with division: 
80° Bottom, only @7q aligns with Agivision. Time-lapse 
357 60° images of 2 cells out of the 29,388 cells analysed. 
P= 0.014 
O5< Nshape < 1.0 


e, f, Difference between experimental (division) and 
predicted division orientations based on interphase 
TC) bipolarity (9-rcy) or cell long axis (Oshape) for 
|Orcy — Oshape| intervals (e) and indicated n,.. 
intervals (f). Kolmogorov-Smirnov test (P values), 
percentage of 29,388 cells. g, TC] bipolarity (@rcy) 
prediction improvement over cell long axis (shape) 
versus |97cj — Oshape|- Mean + s.e.m. of three 

movies for a total of n = 29,388 cells analysed. 

h, Differences (green bars, mean + s.d.) between 
Oshape AN Orcy VEFSUS 175, ,,,.- 45505 cells (grey dots) 
randomly picked from the 29,388 cells analysed. 
Correlation coefficient A experiment = 0.88. 

i, Difference between experimental (O4ivision) and 
predicted division orientations based on interphase 
cell long axis (O.hape) (left) or TC] bipolarity (@r¢y) 
(right) in wild-type and mud cells. Kolmogorov- 
Smirnov test (Pvalues). Scale bars, 1 jm (a, d). 
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bipolarity, whereas in elongated cells TC) bipolarity and cell shape 
orientations are aligned in most cases, and the TC) distribution 
ensures that cell division occurs along the former interphase cell long 
axis. Lastly, cell division orientation along the interphase cell long 
axis and TC distribution was strongly reduced in mud mutant tissue 
(Fig. 3i and Extended Data Fig. 5f, g). Altogether, we propose that 
TCJs, via Mud, constitute the dominant mechanism of division 
orientation along the interphase cell long axis. 

Why are the orientations of cell long axes and TCJ distributions 
aligned? This can be understood by picturing regular hexagonal cells, 
which are then pulled. The cell elongation leads to the alignments 
of cell shape and TCJ bipolarity orientations with the pulling direc- 
tion (Fig. 4a). Computer simulation can then be used to model the 
disordered case of epithelial cells whose shapes depend on adhesion 
and cortical tension!*. The simulations reproduce the alignment 
between cell shape long axis and TC) bipolarity orientations as cell 
shape anisotropy increases, as well as the average alignment of the 
TC) bipolarity and mechanical strain orientation (Fig. 4b, c, Extended 
Data Fig. 10d and Supplementary Video 4). Therefore, generic prop- 
erties of epithelial cells, adhesion and cortical tension, are sufficient 
to reproduce the alignment of TC] bipolarity and cell shape as their 
anisotropy increases. Furthermore, in agreement with the fact that 
global mechanical stress tends to elongate cells*'’, the alignment of 
TC) bipolarity with mechanical stress increases as tissue stress ani- 
sotropy increases, thus accounting for orientation of divisions along 
the global mechanical stress direction (Fig. 4d and Extended Data 
Fig. 10c). Our findings indicate that the alignment of TC) distribution 
with cell elongation and mechanical stress axis is a core geometric 
property of epithelial tissues and accounts for a role of TC]Js as spatial 
landmarks that provide the information needed for cell shape and 
mechanical strain orientation sensing. 

Altogether our findings provide evidence that TCJs can serve as 
built-in interphase shape sensors to orient division when the inter- 
phase cell shape is well defined. This mechanism is distinct from others 
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Figure 4 | TCJ alignment with shape is a core geometric property. 

a, b, Regular hexagonal cells (a) and simulated cell lattice (b) before and 
after elongation. c, Differences (mean + s.d.) between Ohape and Orcy Versus 
Nohape for 4,504 simulated cells (grey dots) and 4,505 experimental cells 

(Fig. 3h). Asimulation = 0.94. d, Difference between @ 7c, and principal strain 
axis (strain) Versus normal stress differences. AU, arbitrary units; n, ablation 
number. e, Upon mitotic rounding, Mud interphase localization is 
maintained at TCJs orienting the spindle along the interphase cell long axis. 


involving extracellular matrix retraction fibres or signalling!”?°-’. It 


accounts for the integration of two properties of epithelial division: 
orientation along the interphase cell shape and mitotic rounding 
(Fig. 4e). The packing of tissue promotes contacts between three (or 
more) cells and the formation of TCJs. TCJs are implicated in epi- 
thelial barrier function”*”® and are the sites of enrichment of several 
proteins including adhesion molecules, cytoskeleton regulators and 
Hippo pathway components****. The alignment of TCJs with cell 
shape or mechanical strain being a geometrical property of epithelia, 
TCJs might therefore provide epithelial cells with an axial polarity (or 
bipolarity) to couple cell shape and tissue mechanics with adhesion, 
cytoskeleton organization and signalling. 
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METHODS 


Fly stocks and genetics. Drosophila melanogaster stocks and associated references 
are listed in Supplementary Table 2. Flies were crossed and experiments were 
performed at 25°C unless specified otherwise. Male or female pupae were used. 
Experiments using the temperature-sensitive allele of p150/Glued, D82glued* were 
performed as previously described for temperature-controlled experiments in the 
Drosophila pupa’. Loss-of-function, gain-of-function and dual-coloured-patches 
experiments were carried out using the FLP/FRT or the MARCM techniques*!**. 
Somatic clones were induced in the second instar larval stage by heat shock (20 min 
at 37°C for FRT19A and FRT40A, 1h at 37°C for FRT42D) and analysed 3-4 days 
after clone induction in 12-20h after pupa formation (hAPF) pupae. The analy- 
ses of dig? loss-of-function clones were performed in small clones (3 days after 
induction) to avoid apical—basal polarity defects**. 

Molecular biology. To create the GFP-Mud or ChFP-Mud transgenes under the 
control of Mud endogenous promoter, we used recombineering*”” to introduce 
a GFP or ChFP tag at the ATG of the mud open reading frame in the CH322- 
147E14 BAC genomic clone (BACPAC Resources Center). This BAC clone contains 
~19.5kb of X chromosome genomic region including ~3.3 kb upstream of the 
mud mRNA 5’ and 5.4kb downstream of the mud 3’ mRNA and thus most of the 
coding regions of genes proximal and distal to mud locus. 

First a galK cassette, amplified with primers F (5’-CATACATATACGGG 
CGCACACACACCCATAAAAACGCACAAAAATTCGCACCTGTTGACAA 
TTAATCATCGGCA-3’) and R (5‘-GATTTACATACCCACTGGAGTA 
GGACCTTGCGCCAGCTGCGCGTGTCCATTCAGCACTGTCCTGCTCCTT- 
3’) (underlined bases indicate galK sequences), was inserted via recom- 
bination at the N terminus of the mud open reading frame. After positive 
selection, the galK cassette was replaced with a GFP tag, primers F 
(5'’-CATACATATACGGGCGCACACACACCCATAAAAACGCACAAAAATT 
CGCAATGGTGAGCAA GGGCGA GGA-3’) and R (5’-GATTTACATACCCAC 
TGGAGTAGGACCTTGCGCCAGCTGCGCGTGTCCATCTTGTACAGCTCG 
TC CATGC-3’) (underscored letters for GFP sequences) or with a ChFP tag, 
primers F (5’-CATACATATACGGGCGCACACACACCCATAAAAACGCA 
CAAAAATTCGCAATGGTG AGCAAGGGCGAGGAGGATAACATG-3’) 
and R (5/-GATTT ACATACCCACTGGAGTAGGACCTTGCGCCAGCTG 
CGCG TGTCCATCTTGTACAGCTCGTCCATGCCGCCGGTGGA-3’) 
(underscored letters for ChFP sequences) via recombination and negative selec- 
tion for galK*’. The attB-P[acman-GFP-Mud]-CmR-BW was integrated into the 
PBac{y[+]-attP-9A}VK00030 landing site at 50E1, PBac{y[+]-attP-9A}VK00031 
landing site at 62E1 and PBac{y[+]-attP-9A}VK00033 landing site at 65B2. The 
attB-P[acman-ChFP-Mud]-CmR-BW was integrated into the PBac{y[+]-attP-9A} 
VK00030 landing site at 50E1, PBac{y[+]-attP-9A}VK00031. 

Deletions within the GFP-Mud genomic region were created by recombi- 
neering using a neomycin resistance cassette flanked by loxP sites**, which were 
amplified by PCR (see below), at the amino acid positions shown in Extended 
Data Fig. 8a. Upon neomycin selection, the cassette was removed by Cre- 
mediated recombination** leaving behind a 78 bp loxP site sequence. The follow- 
ing primers were used for PCR: MudACH: F (5’-CATACATATACGGGCGCA 
CACACACCCATAAAAACGCACAAAAATTCGCAGGCCTGGTGATG ATGG 
CGGGATC-3’) and R (5‘-CTGCTGGGAAGACATGGGCTGACTGAGGTC 
GAAACCCCTTGTCG GTAAACTCAGAAGAACTCGTCAAG AAGGCG-3’, 
Note that the MudACH is not tagged with GFP); GFP-MudACC: 
F (5'-GGCTGTTGACGCGCGAATATCT TAGCCAGGCGATCGC 
CAACGTTGCAGTTCGTTCCTTGTATACGGCGGAGGTGACGCGCATGAAG 
GAGAAGCAGGAACG-3’) and R (5'-CGTTCCTGCTTCTCCTTCATG 
CGCGTCACCTCCGCCGTATACAAGGAACGAACTGCAACGTTGGCGATC 
GC CTGGCTAAGATATTCGCGCGTCAACAGCC-3’); GFP-MudA Pins: 
F (5'-CCGTTTCGTCCAGTTCGTCGGCGCCGAA CGATGACTG 
GCAGCCCTTCAAGCGCCACTCCGGCTCCCAGATAAC-3’) and R 
(5'-CTTACTTTGAGATCTTCGTCCTGGCTGCCCAAATCATATTGGGCAGC 
ATAACT AGTGGATCCCCT CGAGGGACC TAATAAC-3’) and GFP- 
MudATM: F (5/-AATTCACACAACTGGTGGCCGCCTCTTGCAGTAATAT 
CACTACGACTAGCTAGAAGCGGCAACGAAAGCAATGGGAAACGCACAA 
ATCTTGCTGATGATC-3’) and R (5‘-GATCATCAGCAAGATTTGTGC 
GTTTCCCATTGCTTTCGTTGCCGCTTCTAGCTAGTCGTAGTGATAT TAC 
TGCAAGAGGCGGCCACCAGTTGTGTGAATT-3’). 

The MudACH, GFP-MudAPins, GFP-MudACC and GFP-MudATM BAC 
constructs were integrated at the (PBac{y[+]-attP-9A}VK00030 landing site at 
50E]1 and at the PBac{y[+]-attP-9A}VK00033 landing site at 65B2. 

A deletion of the C-terminal domain of Mud including the Pins and micro- 
tubule binding domains (MudAC) (see Extended Data Fig. 8a) was created 
using the CRISPR/Cas9 system*? at the endogenous mud locus. Two sgRNAs 
(upstream targeting sequence; 5‘/-CATCCAGTCTA ACCAGGCGGAGG-3’ and 
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downstream targeting sequence: 5‘-AGATGAGGCGCCGGTCATGTTGG-3’) 
were inserted into pU6B-sgRNA-short”’ and co-injected with purified ssODN 
5'-GGCTGCTTCTCGCTTCCAACCAAGAGTTGGAAGAACTAAATTCCAT 
CCAGTCTAACCAGGT-A-GACCGGCGCCTCATCTTGTACAGTCTATTCGA 
TCGGCAGTG TGCACATGCAGC CGCTGC-3’ (A denotes position of the 
deletion) in the vas-Cas9 line*!. Resulting F1 flies were screened for germline 
transmission of the deletion by single fly PCR. The F2 progeny was sequenced to 
confirm the deletion of the C terminus domain. 

Immunohistochemistry and fixed tissue imaging. Pupae were dissected and 
fixed as previously described”. Primary antibodies were: rabbit anti-Go, (1:500, 
provided by J. A. Knoblich), rabbit anti- Mud (aa375-549) (1:1,000 (ref. 43), mouse 
anti-Gli (1:250 (ref. 44), mouse anti-Dlg (1:1,000, DSHB, 4F3), mouse anti-FasIII 
(1:50, DSHB, 7G10), guinea-pig anti-Cora (1:2,000 (ref. 45), rabbit anti-GFP 
(1:2,000, Molecular Probes). Fluorescent secondary antibodies were: Alexa-488 
goat-anti-rabbit IgG (1:500, Molecular Probes), Cy2, Cy3 and Cy5 donkey- 
anti-mouse IgGs (1:500, Jackson ImmunoResearch). Images were collected with 
confocal microscopes (LSM710NLO or LSM780, Carl Zeiss). All images are 
maximum projections of a z-stack unless otherwise indicated. 

Live imaging. Pupae were prepared for live imaging as described previously”*. 
Samples were imaged at 25°C or 29°C with either an inverted confocal spin- 
ning disk microscope from Nikon or Zeiss using either 40x NA1.3 OIL DIC 
H/N2 WD0.2 PL FLUOR, 60x NA1.4 OIL DIC N2 PL APO VC, 63x NA1.4 OIL 
DICII PL APO or 100 NA1.4 OIL DIC N2 PL APO VC objectives and either a 
CoolSNAP HQ2 (Photometrics), an EMCCD Evolve (Photometrics) or a CMOS 
(Hamamtsu) camera. Live images of FUCCI and GFP-Mud were acquired using 
a confocal microscope (LSM780, Carl Zeiss) and 63 x NA1.4 OIL DICII PL APO 
objective. To improve signal-to-noise ratio, videos and images for display and seg- 
mentation were either deconvolved using Huygens software (Scientific Volume 
Imaging) or denoised using Safir software*’. Average projections of raw z-stack 
images were used for intensity measurements. 

Unless specified otherwise, all experiments on dividing cells were performed 
during the first round of cell divisions in the notum tissue (12-20 hAPF). In the 
analyses (unless mentioned otherwise), the time (t) equals 0 was set at anaphase 
onset defined by the initial cell elongation and/or centrosome movements towards 
the cortex. 

Photobleaching of GFP-Mud or GFP-MudACC in neighbouring cells. Since 
the cortical GFP-Mud or GFP-MudACC signals at the TCJ has contributions 
from both the dividing cell and its neighbouring cells, we performed prediction 
of spindle orientation based on GFP-Mud or GFP-MudACC intensity upon pho- 
tobleaching of GFP-Mud or GFP-MudACC in the neighbouring cells, the residual 
GFP-Mud or GFP-MudACC signal reflecting more faithfully the distribution of 
GFP-Mud or GFP-MudACC at the TC] in the dividing cells. Having found by flu- 
orescence recovery after photobleaching (FRAP) that GFP-Mud turnover at TCJs 
is on the order of tens of seconds (t),,=21+7s, n= 28, data not shown), a region of 
approximately two cell diameters was photobleached (491 nm laser at 100% power, 
12 iterations) around a dividing cell, which was identified by the accumulation of 
GFP-Mud or GFP-MudACC at the spindle poles. Following photobleaching of 
GFP-Mud or GFP-MudACC in neighbouring cells, confocal z-stacks of 14 slices 
(0.5 um per slice) were acquired every 1 min. Cells were used for predictions of 
mitotic spindle orientation when their anaphase onset (t= 0) occurred at least 
4min after photobleaching ensuring that the GFP-Mud or GEP-MudACC signal 
are mainly contributed by the dividing cells. 

Segmentation and tracking of cells during tissue development. To record cell 
division orientation, cell shape and TC) bipolarity during development, maximal 
projected images of multiscale time-lapse videos of pupa notum tissue labelled 
with either E-Cad-GFP (16-28 z-stacks 0.5}1m per slice, 0.322 |1m per pixel, 5 min 
per acquisition, labelling of the apical adherens junctions) or Dlg—GFP (20 z-stacks 
0.5 41m per slice, 0.205 1m per pixel, 3 min per acquisition, labelling of the septate 
junctions) were segmented and cell divisions were tracked as in refs 13,48. 
GFP-Mud kymograph from interphase to mitosis. To generate the kymograph 
of the GFP-Mud signal around the cell contour from interphase to anaphase, the 
cell contour was manually segmented using the GFP-Mud signal. The GFP-Mud 
intensity of the fluorescence signal was recorded as a function of the angle @ of 
each contour pixel around the centre of mass and then plotted as a line for each 
time point. 

Laser ablations of astral microtubules and estimation of cortical forces. Laser 
ablation of astral microtubules were performed in cells labelled with Jupiter-GFP 
or aTub-GFP (microtubule markers) and Sas-4—RFP, Spd-2-REFP or YFP-Asl (cen- 
trosome markers) as well as Dlg-GFP or Nrg-GFP (septate junction markers). 
Images were captured using a two-photon laser-scanning microscope (LSM710 
NLO, Carl Zeiss) equipped with a 63 x NA1.4 OIL DICII PL APO objective (digital 
zoom 3 x) in single-photon bidirectional scan mode lasting 6t=756 ms. The astral 
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microtubules of mitotic spindles, which were parallel to the plane of the epithe- 
lial tissue were severed (t3) using the Ti:Sapphire laser (Mai Tai DeepSee, Spectra 
Physics) at 890 nm with <100 fs pulses with a 80 MHz repetition rate typically set 
at 60% power. 

To measure the recoil velocity (amplitude and orientation) of the centrosomes 

upon astral microtubule ablation, all spindles were registered horizontally with 
the ablated astral microtubules at the top right and the centrosome positioned 
at the origin. Centrosome movements were then manually tracked. The velocity 
(amplitude and orientation) was measured between f) and fy9. 
Image quantifications and GFP-Mud and GFP-MudACC localization 
analyses. To measure and compare the TCJ accumulation of GFP-Mud in wild- 
type, pins, Gli and dig cells (Fig. le), confocal z-stack average projections at the 
level of the septate junction (labelled by FasIII, Cora, PH-ChFP or mRFP) were 
generated using Image] from fixed (pins and dlg) or live (wild-type and Gli) 
tissues harbouring wild-type, Gli, dig and pins mutant cells. Using FasIII, Cora, 
PH-ChFP or mRFP labelling, the positions of the TCJs in each cell were manually 
determined. The FasIII, Cora, PH-ChFP or mRFP labelling were used to draw a 
mask (5 pixels wide) delineating the cell outline at the level of the mitotic spindle 
(as determined by GFP-Mud localization at the spindle poles) of (pro)metaphase 
cells. Following background subtraction, the mean intensity at TCJs (10° over each 
TCJ) was divided by the mean intensity along the rest of the cell outline to obtain 
the accumulation at TCJs. Significance was determined using Student’s t-test. 

To compare the GFP-Mud or GFP-MudACC distributions at TC] in wild- 
type, pins and mud tissues (Fig. 2j or Extended Data Fig. 7b), their co-localization 
with Gli was quantitatively compared as follows. Confocal z-stack average 
projections at the level of the septate junction of fixed (pro)metaphase cells (as 
determined by DAPI staining) expressing either GFP-Mud or GFP-MudACC 
and labelled with Gli and Coracle (Cora, a septate junction marker) were gener- 
ated using Image]. Using the Cora staining, a mask 5 pixels wide was manually 
drawn to measure the raw fluorescent intensity profiles of GFP-Mud or GFP- 
MudACC and Gli in (pro)metaphase cells (as determined by DAPI staining) 
which were normalized by their total fluorescence intensity upon subtraction of 
the background intensity. The co-localization factor (C, vertical axis Fig. 2) and 
Extended Data Fig. 7b) between GFP-Mud or GFP-MudACC and Gli was then 
determined by calculating the area between the GFP-Mud or GFP-MudACC 
and Gli normalized intensity curves. 
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C equals 1 if the two proteins perfectly co-localize and equals 0 if the two proteins 
do not co-localize. The comparisons of the distribution of GFP-Mud, GFP- 
MudACC in wild-type and mud tissues were performed blind (Fig. 2)). Significance 
was determined using Student’s t-test. 

Measurement of apical-basal (AB) angle of the spindle aay. The AB orientations 
of the mitotic spindle (aap) in the different experimental conditions were deter- 
mined by measuring the orientation of the centrosomes (marked by Spd-2-RFP, 
Sas-4—RFP or Sas-4-GFP) relative to the plane of the epithelial tissue (labelled by 
aTub-GFP, Jupiter-GFP or RFP-aTub) using a custom Image] plugin. Statistical 
significance was assessed using the Kolmogorov—Smirnoyv test. 

Analyses of cell rounding and mitotic spindle prediction from interphase cell 
shape and TCJ bipolarity. The analyses of cell rounding from t= —60 min to 
cytokinesis was performed using Dlg-GFP. The mitotic spindle being positioned 
at the level of the septate junction (Extended Data Fig. 1g-i), the segmentation of 
Dlg-GFP cortical signal recapitulates cell shape changes at the level of the spin- 
dle and the distributions of TCJ where GFP-Mud is enriched in interphase and 
mitosis. Owing to the spreading of the Dlg—GFP cortical signal along the lateral 
domain of the cell, the accurate segmentation of the cortical Dlg—GFP signal was 
achieved by manual correction of each individual cell. This can only be performed 
on a limited number of cells during their cell cycle (n= 249 cells from 2 distinct 
videos, 3 min temporal resolution). 

In order to compare the prediction based on cell shape long axis versus TCJ 
bipolarity, a very large number of segmented cells are needed. We therefore used 
E-Cad-GFP time-lapse videos (5 min temporal resolution) since the segmentation 
of the E-Cad-GFP signal can be readily automated and accurate segmentation 
of cell shapes and TCJ positions can be achieved for a very large number of cells. 
Quantifications shown in Fig. 3e-g and Extended Data Fig. 9 were obtained from 
29,388 cells analysed from 3 distinct videos. 

To compare the prediction based on cell shape long axis versus TC] bipolarity 
in wild-type (6 videos, n = 4,860 cells) and mud (8 videos, n=7,770 cells) mutant 
tissue, predictions were performed in a specific region of the tissue were spindle 
misorientation along the AB is very weak (Extended Data Fig. 5f, g). 


Upon Dlg-GFP signal or E-Cad-GFP signal segmentation and cell tracking, 
the following measurements were determined using Matlab. 

(i) The experimental cell division orientation (Msivision) Was determined as the 
orthogonal of the interface between the two daughter cells upon cytokinesis. 
Oaivisions Which correlates very well with cell division orientation measured by the 
positions of the two centrosomes at metaphase (R= 0.91, n= 127 cells, data not 
shown) as established using time-lapse videos of E-Cad—GFP- and Spd-2-mRFP 
(centrosome marker)-labelled epithelial tissue. 

(ii) To characterize cell shape elongation (n,, am, and cell shape long axis orien- 
tation (shape), each cell region was used to construct its inertia matrix. 
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where pix is the number of pixels in the cell and % = (xp, y,)are the vectors point- 
ing from the barycentre of the cell to each pixel of the cell, p. Its eigenvalues As, As, 
with 0 < As < As, have the dimensions of a squared distance. Its eigenvector asso- 
ciated to As defines the direction of the cell's long-axis @.hape). The cell shape 
anisotropy was defined as a dimensionless number, which ranged from 0 for a cell 
perfectly circular, to 1 for an infinitely stretched cell:7).pape = 1 — As / As. 

(iii) To characterize the anisotropy (nyc) and orientation of the TCJ (Orc) 
angular distribution, we built the “TCJ bipolarity’ matrix V: 
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where nrcy is the number of TCJs in the cell and the “H, are the unit vectors point- 
ing from the barycentre of the cell to each cell TC, v (that is, w= /\Rl) 
(Extended Data Fig. 6j). Its eigenvalues Apc, Arcy, with 0 < Atcy < Arcp are dimen- 
sionless numbers. Its eigenvector associated to Aycy defines the direction of the 
long axis of the TCJ bipolarity (@tcy). The TC) distribution anisotropy was defined 
as a dimensionless number, ranging from 0 for TCJ uniformly distributed around 
the cell, to 1 for the theoretical case of TC] split in two groups diametrically 
opposed: trey = 1 — Argy / Arcy- 

Note that unlike the cell inertia S that is calculated using all the pixels making 
up the cell, the TCJ bipolarity V solely uses the unit vectors #, pointing from the 
cell centre to each cell TCJ. By doing so, the TCJ bipolarity disentangles the char- 
acterization of the TC] distribution from cell shape measurement, and any corre- 
lation observed between the two quantities is not due a shape bias in the TCJ 
bipolarity measurement. In the example shown in Extended Data Fig. 6j, although 
the two cells have different shape anisotropies, they share the same set of u, 
vectors and have therefore the same TC] bipolarity. 

Both cell shape anisotropy and TCJ anisotropy were normalized to their respec- 
tive averages over all the cells in the tissue. The cell shape and TC) distribution 
anisotropies are represented with bars whose directions give the direction of their 
respective anisotropies and whose length is proportional to the magnitude of the 
normalized anisotropy. 

To compare the orientations of the cell shape long axis or the TC] bipolarity axis 
with the cell division orientation, the cell shape and TCJ distribution tensors S and 
V were averaged during late interphase from 60 to 30 min before the end of cytoki- 
nesis. The orientations Dstape and Orcy of the resulting averaged tensors were then 
compared to the experimental cell division orientation, Muivison- 

The improvement of spindle orientation prediction is calculated as 

Aw = 1) x 100. Ng is the number of cells for which | A4ivison — Dgrape| <15°, 
and Nr is the number of cells for which | ivision — Orc5| < 15°. The mean 
improvement and its standard deviation were calculated using the improvement 
values from three different videos. Similar improvement values are found when 


determining Ns and Nyy for | Odivision — Oshapeland | Oaivision — Orc [below 5°, 10°, 20° 
or 25°. 


The correlation coefficient A between TC] bipolarity and cell long axis orien- 
tation is calculated as 


_ 258 meos?(A@) 
Sn 


where N is the total number of cells analysed. For each cell, A@; equals Orcy — Ohape 
and 7; is the cell shape anisotropy. The correlation coefficient A ranged from —1 
for complete anti-correlation to 1 for complete correlation. In between, a homo- 
geneous distribution indicating an absence of correlation led to A=0. The corre- 
lation coefficient was calculated over all 4,504 simulated cells or over an equal 
number of experimental cells randomly picked (n= 4,505) among the 29,388 cells 
analysed in Fig. 3e-g and Extended Data Fig. 9. 
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As cell division is symmetric in size in the Drosophila notum we have focused on 

the anisotropy and the orientation of the distribution of the TCJ (bipolarity). The 
analysis of the asymmetry of TC) distribution in epithelial tissue where epithelial 
cells undergo unequal size distribution might provide insights on how unequal 
daughter cells are generated in epithelial tissue. 
Numerical simulations. We used numerical simulations based on the cellular Potts 
model, which is particularly relevant in biology to describe variable cell shape, size, 
packing and irregular fluctuating interfaces of cells**°°. We consider a 2D square 
lattice. Each pixel i has an integer index 0; The mth cell is defined as the domain 
consisting of all pixels with the same index value oj = m. The number of pixels that 
cell has defines its cell area. A cell shape changes when one of its pixels is attributed 
to another cell. Here, the evolution is driven by the minimization of a total energy 
E, which has three physical ingredients: interfacial energy, area constraints and an 
external force applied to the patch of cells. Since the calculations are performed on 
a lattice, we have 


E=AS0[1—6(0%0))]+7 > (Am — Ao)? +k SO (xi— x0) 
ij m i 


The first term represents the contribution of the energy of the interfaces between 
the cells. Minimizing this term leads to perimeter minimization (6 is the Kronecker 
symbol and A is interfacial energy). The second term keeps each cell area A,, close 
to its predefined target value Apo (y is the compressibility). The balance between 
this term and the preceding one simulates a tissue relaxing towards mechanical 
equilibrium. The third term describes an energy gradient®)”’, that is, an elastic 
force field, which pulls on the tissue in opposite directions (k is an elastic constant 
and xp is the x position of the centre of the simulated field). 

The algorithm to minimize E uses Monte Carlo sampling and the Metropolis 
algorithm, as follows. We randomly draw (without replacement) a lattice pixel and 
one of its eight neighbouring pixels. If both pixels belong to different cells, we try 
to copy the state of the neighbouring pixel to the first one. If the copying decreases 
E, we accept it, and if it increases E, we accept it with probability P= e~4#/", where 
AE = Eafter — Ebefore- The prefactor T is a fluctuation (random copying) allowance. 
Because all energy parameters are scalable with T, we can fix it without loss of 
generality; for numerical convenience, we choose numbers on the order of 10°. We 
define one Monte Carlo time step (MCS) as the number of random drawings equal 
to the number of lattice pixels. We ran simulations during 600 MCS to reach a 
tissue shape that no longer evolves (initial image). We then applied a constant bulk 
force that stretched a tissue and ran the simulations for 600 MCS (final image). To 
obtain a large range of cell anisotropies in the simulations we used simulations with 
an elastic constant, k = [0;0.5;1;2;3;4;5; 10] For each value ofk, 5 simulations 
(with about 170 cells in each simulation) were run. Using the initial and final 
images, the tissue elongation along the direction of stretching was calculated for 
each simulation as the relative increase in distances between landmarks. On the 
final image, the segmented cell contours were used to determine the tensors V (TCJ 
bipolarity) and S (inertia matrix) for each cell as well as their averages over all cells 
in the simulation. 

Mechanical stress estimation and TC) bipolarity orientation. To compare the 
mechanical stress and TCJ bipolarity orientations as a function of the normal 
mechanical stress difference (a), — ox), we used oy, — ox, experimental values of 
the estimated mechanical stress obtained up to a prefactor by Bonnet et al.>> meas- 
ured from 12hAPF to 28hAPF in the medial region of the scutellum, where the 
mechanical stress is oriented along the medial-lateral axis (oy, = 0 and @stress = 0). 
The experimental orientation of mechanical stress for each ablation was compared 
to the average TC] bipolarity orientation determined using the segmented cell 
outlines of the E-Cad-GFP cells within the rim of ablated cells (Extended Data 
Fig. 10a, b). 

Predictions of spindle orientation based on experimental Mud distribution and 
cell shape. For predictions of the mitotic spindle orientation based on GFP-Mud 
or GFP-MudACC signals (hereafter referred to as Mud fluorescence signal) in 
wild-type and mud tissues, average intensity (2-4 j1m) projections centred around 
the plane of the centrosomes were generated using a custom Image] plugin for the 
t=—2 min and t=—1 min frames (t=0 corresponding to the anaphase onset). 
A 5-pixel mask that does not overlap with the GFP-Mud or GFP-MudACC cen- 
trosome signal was drawn around the cortex to determine, using a Matlab script, 
the shape of the dividing cell as well as the cortical Mud signal profiles. The experi- 
mental orientation of the mitotic spindle and the positions of the two centrosomes 
were manually determined using the GFP~Mud or GFP~MudACC accumulation 
at the spindle poles (Fig. 2e,i). 

The model predicting spindle orientation from cell shape is similar to the one 
used in refs 8,18. The model based on Mud fluorescence signal is adapted from 
this previous model and inputs the distribution of cortical intensity of GFP-Mud 
or GFP-MudACC obtained from a fluorescence image in a mitotic cell!”"*. 
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This intensity computed around the cell contour is renormalized, so that the sum 
of intensities around each treated cell is the same. For each cell, we aimed to com- 
pute the global torque T generated as a function of the spindle orientation angle? 
(Fig. 2d, g). For each possible spindle orientation 9, (@ varying from 0 to 7) we 
generate two asters of N microtubules nucleated at a constant angular density p 
from centrosomes placed at a distance +L / 2 from the spindle centre along the 
axis 9. Both L, which represents the spindle length, and the spindle centre are 
computed from the experimental position of the two centrosomes (Fig. 2e). 
A microtubule projecting at an angle @ with respect to the spindle contacts the cortex 
at a given location with a GFP-Mud or GFP-MudACC concentration Cmua and is 
assumed to pull on the spindle pole it is connected to with a force f[Cmua (0; ¢)] 
that scales with Crud: f[Cmua (8; ¢)]~ (Cmua)®; with the exponent a representing 
putative nonlinearity in how Mud may influence astral microtubule pulling forces. 
This yields a torque, 7(9, @), at the spindle centre projected along the z axis: 


7(0, 8) = =f1Cm( 8, 8)]sin() 


The resultant total torque T(@) generated by the two asters is then obtained by 
summing the projected torques over all microtubules: 


T(0) = 
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where ® is the total angular width of the aster. Initial tests of the model showed 
that, above a certain threshold, the number of microtubules N (or equivalently the 
angular density: p = N / ®), does not materially impact axis definition (Extended 
Data Fig. 6c-f). Thus, in the model, we keep N asa silent parameter, by normaliz- 
ing the total torque with N. The stable theoretical axis orientation, Oheory can be 
identified from the minima of the potential U(@) computed as a primitive of T (0), 
and compared with the experimental division axis Aivision (Fig. 2f-h and Extended 
Data Fig. 6g). 

The quality of the prediction was computed based on the magnitude of the 

angular deviation between the model and the experiments, |Oheory — Adivision|- 
Overall, the model based on the Mud distribution accounts for observed spindle 
orientation, with a mean angular deviation magnitude of 27.7° + 11.9° (n= 140 
cells). The shape-based model applied to these same cells predicted a higher mean 
deviation magnitude of 37.6° + 12.3° (n= 140 cells). An open question is why the 
model based on Mud distribution predicts spindle orientation within only 27.7° 
For comparison, a previous model applied to dividing sea urchin eggs of various 
shapes made predictions within 15.6°. Although we cannot fully preclude the 
existence of Mud-independent secondary systems that contribute to spindle ori- 
entation, it is important to outline the structural differences in models and bio- 
logical systems that could explain these differences. One first difference is that 
the Mud model infers a fluorescence signal distribution which could be in part 
affected by the imaging itself, yielding variations in Mud signal peak heights or 
widths which do not reflect the actual force field. In agreement with this, the same 
model run with cells where the neighbours are not photobleached makes predic- 
tions within 32.4°(n = 241 cells, data not shown). Another probably more impor- 
tant difference is that in Drosophila epithelial cells the spindles move with a 
time-scale close to mitosis duration (data not shown). By contrast, in large cells 
like zygotes and blastomeres, division axes are stably set for tens of minutes with 
negligible movement and rotation of the spindles'***. Although the lack of a 
standardized Mud distribution precludes us from computing the effective tem- 
perature of the system, the difference in spindle movements suggests that the noise 
in the Drosophila epithelial system studied is much larger than in the other cell 
types previously analysed. 
Predictions of spindle orientation at the tissue scale based on TC) anisotropy. 
In order to study spindle orientation across the entire tissue, we used the bipolar- 
ity axis of the TC) distribution in each cell as a proxy for a prediction based on the 
full Mud protein distribution (see Fig. 3 and the discussion of the tensor V above). 
This quantity has the advantages that it relies only on a marker (E-Cad—GFP or 
Dlg-GFP) that can be reliably imaged over the necessary length and time scales 
and that the predicted spindle orientations can be computed in a reasonable time, 
even for tens of thousands of cells. In this section, we show how the bipolarity axis 
arises naturally as an approximation to a more detailed description that explicitly 
calculates forces and torques. 

We begin by examining in general terms how a cortical force distribution 
translates into a potential U(@) governing the spindle orientation. In the context 
of this analytic formulation, we assume that the rounded, mitotic cells are 
approximately circles with radius R and centre coinciding with the centre of 
the spindle. Points on the cortex can then be labelled by their angle with the 
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positive x axis. If the spindle makes an angle @ with the positive x axis, then a 
microtubule projecting from one of the spindle poles at an angle ¢ to the spindle 
contacts the cortex at an angle 6 = 6 + y to the positive x axis, where y satisfies 
cot(¢) = cot(y) — e/ sin(w) with e=L /2R. (Extended Data Fig. 6h). With 
this relation, one can translate the integral (see above) over ¢ giving the net torque 
into an integral over the circle of the form 


_ pL ( 7 
T(0) =" J [F(8) + f(8 + x)]#(B— 8) 43 


Similarly, the potential can be written as 


PL Pg) + (8-4 a) 
HO =T JV +A8+ AB 0) a8 


Here, we have suppressed the explicit dependence of the force fon Cmua, and the 
kernel Zi(y/) is a 2n-periodic function given, for —t << 1, by 


e€— cos(y) yl <¥ 
: 1+ €?—2€ cos(w) 
ily) = , 
e— cos() yl > 


1+ €?—2e cos(W) 


where W is related to ® as cot(® / 2) = cot(W) — €/sin(W). 

Importantly, 7(y) is hence a symmetric function whose magnitude peaks at 
w= 0. That is, within our model the net effect of the microtubules is to act as a 
linear filter that smooths out the cortical Mud distribution without otherwise alter- 
ing it. We therefore expect that the spindle should generally prefer to orient itself 
towards the direction of highest Mud concentration but that it is more sensitive to 
relatively wide, broad peaks of Mud than to the fine details of a rapidly varying 
Mud distribution. This idea can be expressed more formally by Fourier transform- 
ing the periodic functions U, fand #. With U(@) = 7, Unexp(in6), and similarly 
for fand u, we have U, = pLf, ii, for n even and U, = 0 for n odd. The magnitudes 
of the coefficients i, are plotted in Extended Data Fig. 6i. As expected, the filter 
decreases the importance of components with higher n relative to the dominant 
n= 2 mode. Thus, the orientation of the nm = 2 mode of the force distribution f(@) 
gives the natural, leading approximation to the location of the minimum of U(8). 
For forces centred on the TCJs (where the Mud concentration is highest), the n = 2 
mode of the TCJ distribution similarly gives the natural proxy for f, and thus for 
the orientation of the mitotic spindle. Moreover, the orientation of this bipolar 
mode can be computed far more efficiently than can the location of the global 
minimum ofa potential including all Fourier modes. Indeed, it is a standard result 
that this orientation corresponds with the anisotropy axis of the second rank 
tensor V defined previously. This makes it especially appropriate for use in our 
tissue-scale calculations. 

Statistics. No statistical methods were used to predetermine sample size. Sample 
sizes vary in each experiment. Statistical significances of protein distribution and 
velocity amplitude were assessed using Student's t-test, the distribution normalities 
were checked using Kolmogorov-Smirnov test. In cases where the variances were 
different, significance was assessed using the unequal variance t-test. The angu- 
lar distribution of velocity was assessed using Watson's U? test for circular data. 
GFP-Mud localization at TC) and GFP—Mud or GFP-MudACC co-localization 
with Gli in the different mutant backgrounds were analysed blindly. Kolmogorov- 
Smirnov tests were used to analyse differences in cag spindle orientations and 
differences in division orientation. P values greater than 0.05 are indicated as not 
significant in figure legends or graphs. Predictions of division orientation based 
on GFP-Mud or GEP-MudACC in mud or wild-type tissues were performed 
blindly. Experiments were not randomized and every experiment was repeated at 
least three independent times. 

Code availability. Matlab code used to segment and track cells has been previously 
published!*. Matlab code used to determine division orientation, cell shape and 
TC) bipolarity upon cell segmentation and tracking are available upon request. 
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Extended Data Figure 1 | See next page for figure caption. 


© 2016 Macmillan Publishers Limited. All rights reserved 


| 8shape - 9division | 


mitosis 


LETTER 


Extended Data Figure 1 | Mud, Pins, Ga; and Gli localization during 
symmetric epithelial cell division in the Drosophila notum. a, b, Within 
the Drosophila pupal notum tissue cells divide according to their 
intephasic cell shape long axis, thereby following the 130-year-old Hertwig 
rule. However, upon entry into mitosis cells round up (the cell shown in 

a, —15 to —2 min and Fig. 3b). a, Time-lapse images of Dlg-GFP ina 
dividing cell (out of 249 cells quantified in b) in the pupal notum tissue 
illustrating cell rounding during mitosis (the same cell is shown as inset in 
Fig. 3b). Prior to mitosis (—30 min) the cell (marked by asterisk) is clearly 
elongated and divides according to its interphasic cell shape (5 min). Upon 
entry into mitosis (—15 min) the cell rounds up and reaches a minimal 
anisotropy just before anaphase (—2 min, see also Fig. 3b). b, Rose plot of 
the difference between the experimental (Ouivision) and predicted division 
orientations by the average (60-30 min before mitosis) interphase cell long 
axis (shape). The data are duplicated relative to 0° line (light green). 
Number of cells (7) analysed is indicated. c-e, Ga; localization in fixed 
epithelial dorsal thorax tissue (c), Pins-YFP localization in pins mutant 
tissue (d) and GFP-Mud localization (e) showing cells in G2 interphase 
(left) and mitosis (right). Ga; is hardly detected at the cell cortex in G2 
phase and is mostly homogenously distributed around the cortex during 
mitosis. Pins-YFP is homogenously distributed around the cell cortex in 
both interphase and mitotic cells. In mitosis Pins-YFP also weakly 
localizes at the mitotic spindle. GFP-Mud localizes at TCs during 
interphase and mitosis (see also f). n = 24 cells (c, left); n= 19 cells 

(c, right); 1 = 80 cells (d, left); n = 12 cells (d, right); n = 111 cells (e, left) 
and 54 cells (e, right). f, GFP-Mud time-lapse images from G2 interphase 


to telophase (n= 21 cells). White arrows, GEP-Mud at TCJs (numbered at 
t= —22 min). Red and yellow arrowheads, GFP-Mud on the spindle and 
its poles, respectively. The same panels —22 min to 4 min are shown in 
Fig. 1a. See also Supplementary Video 1. g, Apical—basal (AB) sections of 
the cell in f at t= —22 min (top) and t= —1 min (bottom). White arrows, 
GFP-Mud at TCJs. n= 21 cells. h, GFP-Mud kymograph along the cortex 
(x axis) from t= —22 to t=0 min of the cell in f. TCJs numbered as in f. 
The kymograph shows that during mitotic rounding GFP-Mud spread 
only modestly along the cortex of the dividing cell. n = 21 cells. i, AB 
sections of GFP-Mud, adherens junction marker E-Cad and septate 
junction marker Dlg (top, n= 16 cells) or septate TC) marker Gli (bottom, 
n= 30 cells). j-m, Localizations of GFP-Mud (white in j-m and green in 
j/’-m”’) and Gli (white in j’-m’ and red in j’’-m”’) in fixed pupal wing 

(j, k) and larval wing disc (1, m) tissues. GFP-Mud co-localizes with Gli at 
TCJs in G2 interphase and mitotic cells in both the pupal wing and larval 
wing disc epithelium. Asterisks mark Mud punctate structures present on 
the nuclear envelope of early G1 cells. Yellow arrows indicate GFP-Mud on 
the spindle poles. n = 20 cells (j, j’”); n=5 cells (k, k’’); n = 63 cells (1, 1’’) 
and n= 12 cells (m-m/”). n, 0, Localizations of Mud (white in n, o and 
green in n”’, o’’) and Gli (white in n’, o’ and red in n”, o’’) detected by 
antibody staining in G2 interphase and mitotic cells in the pupal dorsal 
thorax tissue. As observed for GFP-Mud (Fig. 1b and Extended data 

Fig. 1j-m), the endogenous Mud is enriched at TCJ where it co-localizes 
with Gli in G2 interphase and mitotic cells. Yellow arrows indicate Mud on 
the spindle poles. n = 37 cells (n, n’’) and n = 21 cells (0, 0’”). Scale bars, 
lpm (a, ¢, d-g, i, j, k, 1, m, n, 0), 3 min (h). 
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Extended Data Figure 2 | GFP-Mud localizes at TCJ from G2 interphase 
to mitosis. a, Scheme depicting the accumulation of the Drosophila 
FUCCI reporters during the cell cycle. ECFP-E2F1 accumulates during G1 
phase, G2 phase and mitosis, whereas mRFP1-CycB accumulates during 

S phase, G2 phase and mitosis”°. b, Localization of GFP-Mud (green left 
column and white in the second column panels), mRFP1-CycB (red in 

the left column and white in the third column panels) and ECFP-E2F1 
(blue in the left column and white in the right column panels) in epithelial 
cells of the pupal notum tissue. Confocal sections at the level of septate 
junctions are shown. Cells in G1 (n= 21), S (n=6), G2 (n=35) phases 
and mitosis (n = 6) are indicated in the left panels. During both G1 and 

S phase (upper two rows of panels), GFP—Mud is weakly localized at the 
nuclear envelope membrane, weakly localized at the cortex and at the 
apically localized centrioles (not shown). During G2 phase GFP-Mud 
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mitosis 


‘ OE 
mRFP1—CycB t 


interphase 


becomes prominently localized at the TC] (one cell in the first row of 
panels and two cells in the third row of panels). Arrows indicate examples 
of TCJ GFP-Mud accumulation. During mitosis GFP-Mud remains 
localized at the TCJ and accumulates on the spindle and the spindle pole 
(bottom row panels). Similar results were obtained on fixed tissue for 
which the cell cycle phases were determined using the PCNA S-phase 
marker and the nucleus size to distinguish cells in G1 or G2 interphases 
(not shown). c, GFP-Mud (green arrows) and ChFP-Mud (red arrows) in 
adjacent tissue patches in G2 (n = 31) and mitotic (n = 8) cells. The FLP/ 
FRT system was used to generate adjacent groups of cells labelled with 
either GFP—Mud or ChFP-Mud. By analysing the distribution of GFP- 
Mud in dividing cells adjacent to ChFP-Mud interphasic cells, we found 
that GFP-Mud was localized at the TCJs of the dividing cell from G2 
through mitosis. Scale bars, 1 1m. 
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Extended Data Figure 3 | Regulation of Dlg, Gli and Mud localization by the loss of mRFP-nls, red in c’, n= 13 clones). The loss of Dlg 
in epithelial tissue. a, b, Localizations of Gli (white in a and green ina’, function results in a loss of Gli localization at TCJs. d, Localization of 
n=2 clones) and Dlg (white in b and green in b’, n = 3 clones) in fixed Dlg-GFP (white in d and green in d’) in live epithelial dorsal thorax tissue 
notum tissues harbouring mud clones (identified by loss of mRFP-nls, harbouring a Gli clone (identified by expression of PH-ChEP, red in d’, 
red in a’-b’). The loss of Mud function does not modify the Gli and Dlg n=5 clones). The loss of Gli function does not affect the distribution of 
localizations at septate junctions. ¢, Localization of Gli-YFP (white in c Dlg-GFP at the septate junctions. Scale bars, 5m. 


and green in c’) in live notum tissue harbouring a clone of dlg (identified 
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Extended Data Figure 4 | Astral microtubules contact Mud at the correspond to its orientation at the final time point (see Supplementary 
TCJ. a, a’, Time-lapse images of ChFP-Mud (green in a and top panels Video 2). Similar results were obtained in cells expressing GFP—Mud and 
of a’, white in bottom panel of a’) and of Jupiter-GFP (red in a and top aTub-RFP to label the microtubules (data not shown). b, Schematic of the 
panels of a’, white in middle panel of a’) in dividing cells (n= 11) in laser-ablation assay used to estimate the origin and magnitude of forces 
the Drosophila pupal notum tissue. The panels in a’ are magnifications on astral microtubules required for spindle orientation in the Drosophila 
of the boxed region in a. Yellow arrow indicates an astral microtubule pupal dorsal thorax epithelium. Upon ablation (red lines, top), pulling 

that contacts ChFP-Mud at the cortex and shortens concomitant to the forces (green arrows, left column) or pushing forces (green arrows, right 
spindle pole movement towards the TCJ and spindle rotation. The dashed column) yield recoil away (grey arrow, left column) or towards the ablation 
line corresponds to the initial spindle orientation and the solid lines site (grey arrow, right column), respectively. Scale bars, 1 1m. 
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Extended Data Figure 5 | See next page for figure caption. 
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Extended Data Figure 5 | A large proportion of mitotic spindles remain 
planar in the absence of Mud, Dlg and Dynein function. Loss of Mud 
activity is known to induce defects in mitotic spindle orientation relative 
to the apical—basal axis (AB) of the cell?’**°°. Nonetheless, in mud, dlg, 
dynein (gl?%) mutant cells around 50% of the epithelial cell divide with 

an AB angle (ag) in the range of the wild-type tissue (a—e). Since a 

large proportion of the spindles remain within the plane of the tissue, all 
analyses reported in the manuscript were performed on cell divisions that 
occurred within the plane of the tissue. Furthermore, in a central region of 
the tissue (box in f and f’), 88% of the divisions in mud tissue occur with 
an (az in the range of the wild-type tissue (g). This region was analysed 

to compare TC] bipolarity and cell-shape-based predictions of division 
orientation in wild-type and mud tissue (Fig. 3h). a, AB views of a dividing 
epithelial cell in wild-type (left panel, out of 257 cells quantified in b) or 
mud (right panels, out of 176 cells quantified in b) tissue. The spindle is 
labelled using Jupiter-GFP (green) and the centrosomes using Sas-4—RFP 
(red). aap varies from 0° (spindle parallel to the plane of the tissue) to 90° 
(spindle perpendicular to the plane of the tissue). b, Quantification of aap 
in wild-type, mud and in mud tissue expressing GFP-Mud (mud, GFP- 
Mud). In wild-type tissue, apg varies between 0 and 22° (blue dashed line). 
In mud tissue, 56% of cells divide with a aap angle lower than 22° (dashed 
red lines). The expression of GFP-Mud in mud tissue rescues the spindle 
AB orientation phenotype caused by Mud loss of function. Numbers of 
cells (n) for each genotype are indicated. The distribution of angles in 

mud tissue is significantly different from wild type (P< 1 x 107“), and is 
restored in mud, GFP-mud (P < 1 x 107“). P values, Kolmogorov-Smirnov 
test. c, Quantification of ag in wild-type, Ga; and pins. The loss of either 
Ga, or Pins function does not affect the orientation of the spindle relative 
to the plane of tissue (P > 0.3) in agreement with our findings that Mud 
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localization at TCJs is independent of Pins and Go. The analysis in pins 
tissue confirmed previously published findings**. Number of cells (7) are 
indicated. P values, Kolmogorov-Smirnov test. d, e, Quantification of 
ag in wild-type, Gli and dlg tissues at 25°C (d) and in wild-type and gl>N 
tissues at 29°C (e). Gli loss of function does not affect ap orientation, 
whereas 46% of the dlg cells (P< 1 x 10~*) and 59% of the gl cells 
(P<1x 10~*) divide with aap lower than 22° and 24°, respectively. 
Numbers of cells (1) are indicated. P values, Kolmogorov—-Smirnov test. 

f, f’, Identification of a region of the notum where AB orientation of the 
spindle is not affected in mud mutant tissue. Defects in AB orientation of 
the mitotic spindle result in size asymmetry of the two daughter cells*”. 
Therefore daughter cell size was initially used as a proxy for the magnitude 
of spindle misorientation along the AB axis in mud tissue. The maps of 
daughter cell size asymmetry in wild-type (f) and mud (f’) tissues (green, 
no size asymmetry; purple, strong size asymmetry) revealed that a region 
(highlighted by the black box, f, f”) in the mud notum tissue exhibits 
almost no defects in daughter cell size asymmetry. Accordingly the 
quantification of spindle AB orientation within the region in wild-type 
and mud tissue revealed that 88% of the cells of the region divide within 
the range of the wild-type cells (see g). Anterior is to the right and the 
dashed back line indicates the midline. Colour coding: purple, daughter 
cells with strong size asymmetry; green, daughter cells with normal size 
symmetry; cyan, cells for which no division was detected; grey, cells which 
left the field of view and were not analysed; yellow, macrocheatae; white, 
sensory organ precursors (SOPs). g, Quantification of ag in wild-type 
and mud tissue in the boxed regions in f and f’ was performed as in b-e. 
Numbers of cells (1) for each genotype are indicated. Scale bars, 1 zm (a), 
100m (f, f’). 
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Extended Data Figure 6 | See next page for figure caption. 
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Extended Data Figure 6 | Spindle orientation modelling. a, Mitotic cell 
in the Drosophila pupal notum labelled with Jupiter-GFP to label 
microtubules (1 = 23 cells). White arrows indicate astral microtubules. 
Yellow arrowheads indicate spindle poles. Scale bar, 1 jum. 

b, Representation of the different parameters that were varied for the 
predictions based on the GFP-Mud cortical intensity and shape model to 
estimate their contribution. L, length of the mitotic spindle; N, number of 
astral microtubules; ®, the angle covered by the astral microtubules; anda, 
the GFP-Mud intensity scaling factor. See also Supplementary Table 1. 
c-f, Cumulative plots of the differences between the theoretical spindle 
orientation (theory) and the experimental spindle orientation (6division) 
angles in GFP-Mud-expressing cells (same cells as in Fig. 2h) for different 
spindle lengths (c), microtubule number (d), angular extension of astral 
microtubules (e) and different scaling factor between the GFP-~Mud 
intensity and mechanical pulling force (f). The GFP~Mud model 
predictions are mostly independent of spindle length, the number of astral 
microtubules, the angle covered by the astral microtubules or the scaling 
factor between GFP-Mud intensity and microtubule pulling force. 

g, Dependence of model prediction on shape or GFP-Mud effective 
potential depth (+s.e.m.). The y axis quantitates the difference between 
the theoretical angle (Otneory) and experimental angle (division) (1, aligned; 
—1, perpendicular). A larger potential depth corresponds to more 
deformed cells for the shape model, and to a sharp and anisotropic GFP- 
Mud distribution for the cortical model. Model predictions improve with 
potential depth, suggesting the model can capture the effect of GFP-Mud 
distributions in a dose-dependent manner. n = 140 cells. h, Definitions of 
the angles used in the analytical calculation of the contribution of different 
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harmonics to the potential U(@). The spindle (heavy black line) makes an 
angle @ with the positive x axis. An astral microtubule (thin black line 
indicated by the black arrow) projects to the cortex (circle) at an angle 

¢ with respect to the spindle. The same microtubule contacts the cortex an 
angle = y+ @ above the positive x axis. i, Normalized magnitudes 

|in| / |\@2| of the Fourier coefficients of the kernel (wy) for n even. 

The magnitudes |i,| drop off substantially with increasing n, indicating 
that for many purposes it should be sufficient to approximate the function 
U by its lowest, n = 2 mode. To calculate numerical values for the 

Fourier coefficients, we took the average of the normalized spindle 

length e= L / 2R or the n= 140 cells analysed in this paper, obtaining 

€ 0.76 + 0.03; because it is difficult to precisely estimate ® from the 
available data, coefficients are shown for ® = 180° and 270° in agreement 
with the astral microtubule distribution observed in a. j, Schematic 
illustrating the difference between cell shape and cell TC) bipolarity 
measurements. An elongated cell and a rounded cell are overlaid (left 
panels) and shown side-by-side (middle and right panels). In this example, 
although the two cells have distinct shapes, they have the same TCJ 
bipolarity. The upper panels illustrate the measurement of cell shape, 
which uses all the pixels making up the cell (blue bars). The lower panels 
illustrate the measurement of TC] bipolarity (red bars), which is solely 
based on the angular distribution of the TCJs (red dots), only using the 
unit vectors u, pointing from the cell centre (black dot) to each cell TC]. 
The TCJ bipolarity therefore characterizes TCJ distribution independently 
of cell shape, and a correlation observed between the two quantities is not 
due to a shape bias in the TC] bipolarity measurement. 
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Extended Data Figure 7 | Pins does not contribute to Mud-dependent 
epithelial cell division orientation. a, Rose plots of the difference between 
the theoretically predicted (@neory) and the experimental division (division) 
orientation of the mitotic spindle in pins tissue (orange, left rose plot) and 
wild-type tissue (green, right) based on the GFP-Mud intensity. 

To facilitate the comparison between the left and the right rose plots, 

the data are duplicated relative to 0° line (light orange and light green). 
Number of cells (7) analysed is indicated. P values, Kolmogorov-Smirnov 
test. b, Quantifications of the co-localization of GFP-Mud with Gli in pins 
in metaphase cells (mean + s.e.m.). Number of cells (1) analysed is 
indicated. ns, not significant (Student’s t-test). 
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Extended Data Figure 8 | Structure-function analyses of the Mud 
protein in epithelial cells. a, Diagram of the domains of the Mud protein: 
putative actin binding calponin homology domain (CH, amino acids 
1-246, blue), coiled-coil domain (CC, amino acids 246-1,868, grey), 
conserved Numa/Lin-5/Mud domain (NLM, amino acids 1,968-1,998, 
red), putative transmembrane domain (TM, amino acids 2,456-2,499, 
yellow), 9x repeat domain (amino acids 1,137-1,515), microtubule- 
binding domain (MT, amino acids 1,850-2,039) and Pins binding domain 
(amino acids 1,928-1,982)?*®. GEP- or ChFP-tagged deletion constructs 
and the MudACH constructs were generated by BAC recombineering (see 
Methods for details). The MudAC allele was generated at the mud locus 
using a CRISPR/Cas9 approach (see Methods for details). For each mutant 
allele, its localization at the TJCs and its localization at the spindle pole are 
indicated. b, Localization of the GFP—Mud, MudACH, GFP-MudACC, 
GFP-MudA Pins, GFP~MudATM and in G2 interphase and mitotic mud 
epithelial cells. GFP-Mud, GFP-MudACC, GFP-MudA Pins, GFP- 
MudATM proteins were imaged in living tissue, whereas MudACH was 
localized on fixed tissue using Mud antibodies. GFP-Mud (n= 56), 
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MudACH (n= 33), GFP-MudACC (n= 165), GFP-MudAPins (n = 42) 
and GFP-MudATM (n= 67) interphase cells. GFP-Mud (n= 15), 
MudACH (n= 4), GFP-MudACC (n= 67), GFP—MudAPins (n= 18) and 
GFP-MudATM (n= 11) mitotic cells. c, Localization of the MudAC 
protein (white in the left panels, green in the right panels), Gli (white in 
the panels in the middle and red in panels at the right) and Cora (magenta 
in the right panels) in fixed G2 interphase (n= 71) and mitotic (n= 6) 
cells. The MudAC protein is not enriched at TJCs and its localization at 
the spindle pole is strongly reduced. d, Rose plots of the difference 
between the theoretically predicted (Ojneory) and experimental (division) 
spindle orientation angles in wild-type (left rose plot, green) and mud 
(right rose plot, orange) tissues based on the distribution of GFP- 
MudACC. The right rose plot is identical to the one shown in Fig. 21. To 
facilitate the comparison between the left and the right rose plots, the data 
are duplicated relative to 0° line (light green and light orange). Number of 
cells (n) analysed is indicated. P values, Kolmogorov-Smirnov test. Scale 
bars, 1 um. 
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Extended Data Figure 9 | Predicting cell division based on TCJ 
distribution. a, a’’, Rose plots of the magnitude of the difference between 
experimental (givision) and predicted division orientations by the average 
(60-30 min before mitosis) interphase TC] bipolarity (Orc) or cell long 
axis (@hape) in cells for the indicated |Or¢j — Oshape| intervals. To facilitate 
the comparison between the left and the right rose plots, the data are 
duplicated relative to 0° line (light blue and light red). Kolmogorov— 
Smirnov test (P values), percentage of total cells (1 = 29,388). Panels b and 
b” are identical to panel e in Fig. 3. b, b’’, Rose plots of the magnitude of 
the difference between experimental (Qgivision) and predicted division 
orientations by the average (60-30 min before mitosis) interphase TCJ 
bipolarity (@rcy) or cell long axis (@shape) for the indicated 7,,,,,. intervals. 
To facilitate the comparison between the left and the right rose plots, the 
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data are duplicated relative to 0° line (light blue and light red). 
Kolmogorov-Smirnov test (P values), percentage of total cells 

(n= 29,388). Panels c and c” are identical to panel f in Fig. 3. c, Plot of the 
spindle orientation prediction improvements (colour-coded from dark 
blue to red) based on TC] bipolarity over those based on cell shape versus 
the magnitude of their angular difference (|@rcj — Oshape |) and the cell 
shape anisotropy (7,,.,.): Fhe plot height is the normalized cell number in 
each domain of the plot (29,883 cells were analysed in total). 

As |@rcy — Oshape | increases, the TC] bipolarity predictions improve over cell 
shape prediction for both rounded (low 7),,.,.) and elongated cells (high 
TNehape)” Whereas the rounded cells are characterized by an even 
distribution along the |Orcj — Oshape | axis, the elongated cells are mainly 
characterized by a strongly skewed distribution towards low |@r¢y — Oshape |- 
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a, b, Images of the scutellum tissue before and after ablation (ablated 

region in yellow) in early and late pupa characterized by small isotropic 
stress (a) and high anisotropic stress (b). Tissue stress was estimated by 
determining the initial recoiled velocity upon circular ablation in the x 


and y directions*’, First and last images of two time-lapse movies out of 


the 18 quantified in c are shown. Scale bars, 101m. ¢, Plot of the difference 
between the orientation of TC) bipolarity (@rcy) and principal strain axis 
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Oxy = 0) as estimated up to a prefactor by circular laser ablation. Number 
of ablations (1) analysed is indicated. The same plot is shown in Fig. 4d. 

d, Plot of the difference between the orientation of TCJ bipolarity (Orc) 
and the orientation of strain (Ostrain) as a function of the percentage of cell 
elongation applied to a simulated cell lattice. When cell elongation 
increases TC) bipolarity orientation becomes aligned with the direction of 
cell elongation. Number of simulations (1) analysed is indicated. 
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Structural basis for promiscuous PAM recognition in 
type I-E Cascade from E. coli 


Robert P. Hayes'!, Yibei Xiao!, Fran Ding!, Paul B. G. van Erp*, Kanagalaghatta Rajashankar’, Scott Bailey*, Blake Wiedenheft* & 


Ailong Ke! 


Clustered regularly interspaced short palindromic repeats 
(CRISPRs) and the cas (CRISPR-associated) operon form an RNA- 
based adaptive immune system against foreign genetic elements 
in prokaryotes!. Type I accounts for 95% of CRISPR systems, and 
has been used to control gene expression and cell fate”*. During 
CRISPR RNA (crRNA)-guided interference, Cascade (CRISPR- 
associated complex for antiviral defence) facilitates the crRNA- 
guided invasion of double-stranded DNA for complementary 
base-pairing with the target DNA strand while displacing the non- 
target strand, forming an R-loop*®. Cas3, which has nuclease and 
helicase activities, is subsequently recruited to degrade two DNA 
strands*®’. A protospacer adjacent motif (PAM) sequence flanking 
target DNA is crucial for self versus foreign discrimination**"°. 
Here we present the 2.45 A crystal structure of Escherichia coli 
Cascade bound to a foreign double-stranded DNA target. The 
5’-ATG PAM is recognized in duplex form, from the minor groove 
side, by three structural features in the Cascade Csel subunit. 
The promiscuity inherent to minor groove DNA recognition 
rationalizes the observation that a single Cascade complex 
can respond to several distinct PAM sequences. Optimal PAM 
recognition coincides with wedge insertion, initiating directional 
target DNA strand unwinding to allow segmented base-pairing 
with crRNA. The non-target strand is guided along a parallel path 
25 A apart, and the R-loop structure is further stabilized by locking 
this strand behind the Cse2 dimer. These observations provide the 
structural basis for understanding the PAM-dependent directional 
R-loop formation process!”!8, 

Differentiating ‘self’ from ‘non-self’ is crucial in CRISPR systems, 
as foreign target sequences (protospacers) are identical to sequences 
recorded in the host CRISPR locus (spacers). In type I and II CRISPR 
systems, foreign DNA detection relies on protein-mediated PAM 
recognition**-!®, Whereas PAM recognition in Cas9-based type 
II systems has been attributed to major groove DNA contact!?”®, it 
remains unclear whether Cascade recognizes PAM in the major or 
minor groove’, in single-stranded or double-stranded form*”!. The 
promiscuity in PAM recognition also remains unexplained. Five 
PAM sequences (5’/-ATG, AAG, AGG, GAG and TAG reading from 
the non-target strand) can trigger robust CRISPR interference by 
E. coli Cascade*''!”, Crystal structures of E. coli Cascade in free and 
single-stranded DNA (ssDNA)-bound forms revealed multiple confor- 
mational states and insights about the crRNA-guided ssDNA recogni- 
tion mechanism*!4; however, the mechanisms for double-stranded 
DNA (dsDNA) entry, PAM recognition, and R-loop formation remain 
poorly defined. 

To understand PAM-dependent foreign DNA recognition, we 
determined the 2.45 A crystal structure of E. coli Cascade bound to 
dsDNA that forms a partial R-loop (Fig. la-c and Extended Data 
Fig. 1); such DNA substrates are known to bind Thermobifida fusca 


type I-E Cascade and to recruit Cas3 specifically”’. Our structure 
agrees well with the cryo-electron microscopy (cryoEM) recon- 
struction of the full R-loop-Cascade complex, underlining its valid- 
ity in explaining R-loop formation”! (Extended Data Fig. 2 and 
Supplementary Video 1). Comparison with high-resolution crystal 
structures suggests that R-loop formation requires both the sliding 
of Csel-CTD (carboxy-terminal domain)-Cse2.1-Cse2.2, as seen in 
the ssDNA-bound Cascade structure, and the engagement of Csel- 
NTD (amino-terminal domain), as seen in the free Cascade struc- 
ture”!”54 (Extended Data Fig. 3 and Supplementary Videos 2 and 3). 
In addition, a localized conformational change, only observed in this 
R-loop-bound structure, occurs near the putative Cas3-binding site?!, 
this may have a role in Cas3 recruitment (Extended Data Fig. 4). 
dsDNA enters Cascade between Cas7.5 and Cas7.6, contacted by the 
lysine-rich helices'®”¢ (Fig. 1d, e and Extended Data Fig. 5). DNA bifur- 
cates underneath PAM. The entire target DNA strand flips to form the 
segmented DNA-crRNA duplex’’. The 10-nucleotide non-target strand 
is guided to a parallel path 25 A apart by sequence-nonspecific contacts, 
an active mechanism to stabilize the R-loop (Fig. 1d, e). Modelling 
dsDNA beyond PAM projects it across the Csel-CTD without severe 
steric clashes (Fig. 1f), illustrating a possible PAM-searching scenario”’. 

The 5’-ATG PAM sequence is recognized by Csel in the 
double-stranded form, from the minor groove side (Fig. 2a, b). This 
mode of recognition strongly biases towards the target DNA strand, 
which rationalizes previous observations that mismatched PAMs 
could be tolerated, provided that the target strand sequence was opti- 
mal*?!. PAM recognition requires three structural features in Cse1: 
a glutamine wedge, a glycine loop and a lysine finger (Fig. 2a, b). Only 
Cr_1-Gyr_1 is tolerated at the —1 position (PAM—1) in E. coli>', 
although recent analyses revealed spacer-dependent tolerance of alter- 
native base pairs at PAM—1 (ref. 28). In our structure (Fig. 2a—d), the 
amide of Ala355 in the glutamine wedge donates a H-bond to O2 of 
Cr_1, specifying a pyrimidine in the target strand. The carbonyl of 
Gly157 in the glycine loop accepts a water-mediated H-bond from N2 
of Gyr_1. Gnr_1-to-inosine substitution suggests that this contact only 
provides minor discrimination against Ayyr_) (Extended Data Fig. 6a). 
The affinity of Cascade for different PAM—1 base-pair combinations 
correlates well with the structural observation that a target strand 
pyrimidine is strongly specified (Extended Data Fig. 6b). The tenfold 
differences in the dissociation constant (Kg) values, however, do not 
support an absolute Cy_|-Gyr_, specification at PAM—1. Indeed, 
Cas3 can cleave alternative PAM—1 targets, provided that the Cascade 
concentration is above the corresponding Kg value (Extended Data 
Fig. 6c). These results echo the recent finding in suggesting that the 
PAM-~—1 readout is further complicated by the Cascade/Cas3 expres- 
sion level and spacer content”®. 

Importantly, PAM recognition coincides with insertion of a glu- 
tamine wedge into dsDNA underneath PAM (Fig. 2a, d). The tip 
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Figure 1 | Foreign-DNA-bound Cascade structure. a, Arrangement 
of I-E CRISPR-cas locus in the E. coli K12 genome. Colour schemes 
are preserved throughout all figures. b, c, Nucleic acid sequences (b) 
and overall views (c) of the foreign- DNA-bound Cascade structure. 
d, Entry of dsDNA between Cas7.5 and Cas7.6, PAM recognition by 


residue Gln354 stacks underneath Cr_; and, together with Asn353, 
sterically displaces the first two protospacer nucleotides in the target 
strand, forcing them to rotate outwards. Given its strategic location, it 
is rather surprising that this wedge is not highly conserved in sequence 
(Extended Data Fig. 7). Indeed, tip residue substitutions (Gln354Ala, 
Asn353Ala and Gln354Ala/Asn353Ala) barely affect DNA binding. By 
contrast, trimming this wedge (NNQAS352-356/GG) reduced DNA 
binding 100-fold, suggesting that the wedge functions bya steric inter- 
ference mechanism to nucleate the target strand displacement upon 
PAM recognition (Fig. 2e, f). A serine-to-phosphate ‘lock is essential 
in initiating the target strand flipping in Cas9 (ref. 19). Thr125 is in 
a similar location in our structure but contacts the +1 bridging oxy- 
gen instead, and the Thr125Ala substitution had a negligible defect 
(Fig. 2e). 

Recognition at PAM—2 is promiscuous. Only Gr_2-Cynr-_z is 
rejected at this position; the other three combinations lead to effi- 
cient interference”. Here the glycine-loop residues (159-161) assume 
a lip-like structure, introduce DNA bending at Ay_.-Tyr_, and bite’ 
onto the PAM—1 base pair in conjunction with the glutamine wedge 
underneath; Tyy_2 retreats backwards and tilts upwards (Fig. 2b, d). 
The rim of the glycine loop explores shape-complementarity to Ar_», 
and donates a weak H-bond to N3 of Ay_». Gly160Ala substitution 
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three structural elements in Csel-NTD (magenta), and the partial R-loop 
underneath PAM. e, Schematic of Cascade-DNA contacts around PAM 
regions. Hydrogen bonds and electrostatic contacts are shown as dashed 
lines, hydrophobic interactions as solid lines, and water molecules as blue 
circles. f, Modelling of Cascade sampling B-form dsDNA for PAM. 


disrupts the shape complementarity, and reduces Cascade binding 
affinity by ~100-fold and Cas3 cleavage to baseline levels (Fig. 2e, f). 
Rejection of Gy_.-Cyy_2 at PAM-—2 is rationalized by the fact that 
the N2 amine of Gy_, would introduce steric clashes against the gly- 
cine loop; whereas T7_2-Anr-_2 or Cr_2-Gnr_2 would not, based on 
modelling (Extended Data Fig. 8a). Indeed, removal of this amine by 
inosine substitution largely rescued the DNA-binding defect, confirm- 
ing that the N2 of Gr_. is the anti-determinant for PAM—2 specificity 
(Extended Data Fig. 8b). An equivalent glycine-rich loop is present in 
all known Csel structures; they probably have a similar minor groove 
DNA recognition function (Extended Data Fig. 7). 

PAM -3 is typically specified as a pyrimidine;—purineyy pair by 
Cascade*!!!, 5/-TAG PAM also leads to interference, but Gr_3—-Cnr_3 
containing PAMs fail to””®. Here a favourable electrostatic interac- 
tion from a lysine finger (Lys268) to O2 of Ty_3 is observed, underly- 
ing the strong preference for pyrimidine at this position (Fig. 2b, d). 
Lys268Ala mutation reduced Cascade binding and Cas3 cleavage by 
>8-fold and >10-fold, respectively, emphasizing its positive contri- 
bution to PAM—3 recognition (Fig. 2e, f). Interestingly, the Lys268Ala 
mutant still retained wild-type level discrimination against 5’-CTG 
PAM (Extended Data Fig. 8c). Gr_3-to-inosine substitution ultimately 
proved that the N2 of Gy_; also serves as a strong anti-determinant 
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Figure 2 | PAM recognition by Csel subunit of Cascade. a, b, Detailed 
views featuring PAM recognition by the glutamine wedge (and its 
involvement in target DNA strand displacement), glycine loop, and lysine 
finger of Csel. The 2F, — F, electron density map is shown at 1.50. 
c, Summarization of the five interference PAMs in E. coli type I-E CRISPR 
system, with the observed Cascade contacts marked. d, Top-down views 
of PAM recognition at each base pair. Left, stringent recognition of 


in the rejection of Gr_3-containing PAMs (Extended Data Fig. 8d). 
Interestingly, Lys268 makes an electrostatic interaction to Cr_4 
(Fig. 2b, d), indicating certain level of sequence discrimination at 
PAM—4as well. 

Detailed structure dissection also helps to rationalize the self-avoid- 
ance mechanism. All spacers in E. coli CRISPR loci are ‘protected’ 
by a 5’-CCG PAM. This PAM is the combination of the least- 
preferred nucleotides at each position (3’-Gy_3Gy_2Cy_;), which 
would strongly disfavour Cascade-mediated R-loop formation, despite 
a perfect spacer match. 

The non-target strand is guided sequence-nonspecifically along the 
Cascade surface, 20-25 A away from the target strand (Fig. 3a). The 
sugar phosphates of nucleotides 1-3 are contacted by Lys163, Gly169 
and Lys296 of Csel-NTD, nucleotides 6-9 by a string of positive 
charges (Arg393/Lys394/Lys488/His489/Arg491) across Csel-CTD 
(Fig. 3a). The redundant interactions were not disrupted by a point 
mutation (Tyr397Ala). A double mutant (488-489KH/AA) neutraliz- 
ing a positive patch, however, led to a fourfold binding defect (Fig. 3b). 
The tenth/last nucleotide rests at an intersection between Csel-CTD 
and Cse2.1. To investigate whether the following non-target residues 
travels on the surface or backside of the Cse2 dimer, we further deter- 
mined a 3.2A structure in which the non-target protospacer is 22 
nucleotides longer. Although most of the additional residues remain 


CGGTATTGTTCAGATCCTGGCTTGCCAACAGCTGCAGGTCGACAAGCTTGCGGCCGC 3’ non-target 


PAM~—1. Middle, shape complementarity to PAM—2. Right, Electrostatic 
contacts to PAM—3 and —4. e, f, Mutagenesis assayed by Cascade-binding 
(electrophoretic mobility shift assay, EMSA) (e) and Cas3-mediated 

DNA cleavage (f) to evaluate the observed PAM contacts. WT, wild type. 
Experiments were done in triplicate and representative results are shown. 
Uncropped gels are shown in Supplementary Fig. 1. 


unresolved, density clearly reveals that the non-target strand residues 
take the downward trench route towards the backside of the Cse2.1 
dimer, attracted by the favourable electrostatic environment therein 
(Fig. 3c-e). Non-target strand sequestration probably corresponds to 
the extra ‘locking’ step after most R-loop forms, as a mechanism to 
prevent the R-loop collapse!” 

In summary, our structural analysis provides important insights 
about the PAM-dependent directional R-loop formation” (Fig. 4). 
Recognition of an interference PAM by Cascade coincides with the 
wedge- mediated displacement of the first two target strand nucleo- 
tides, initiating DNA unwinding. Directional DNA melting ensures 
the ordered guidance of the non-target DNA strand ~25 A away from 
the target strand, as a mechanism to stabilize the seed bubble. Further 
R-loop propagation leads to non-target strand sequestration behind 
the Cse2 dimer, locking the R-loop in place. Conformational changes 
accompany the process and reorganize the Csel surface, paving the 
way for Cas3 binding. The active guidance of the non-target DNA 
strand is a theme not observed in Cas9-DNA structures'””°. It ration- 
alizes the observation that the Cascade-bound R-loop is considerably 
more stable than that of Cas9 (ref. 17). Besides 5 interference PAMs, 
21 other PAMs stimulate ‘primed adaptation in E. coli”, in which 
Cascade and Cas3 actively recruit the Cas1/Cas2 spacer acquisition 
complex’®*°, Priming PAMs may lead to suboptimal Cascade contacts 
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Figure 3 | Active guidance of the non-target DNA strand by Cascade. 
a, Sequence-nonspecific electrostatic contacts guide the 10-nucleotide 
non-target strand (red sticks) protospacer along the Csel-NTD surface 
(cyan), and across a binding site on Csel-CTD (green). Tyr397 and Leu481 
from Csel-CTD further interdigitate between di-nucleotide stacks. 

b, EMSA evaluating non-target contacts by Cascade. Experiments were 
done in triplicate and representative results are shown. c, d, Nucleic acid 
sequence (c) and zoomed-in view of the 3.2 A structure (d) of Cascade 
programmed with a 22-nucleotide (nt) longer non-target strand. The 
2F, — F, electron density map at 1.0c clearly reveals that the longer non- 
target DNA strand takes the trench route. Inset illustrates the favourable 
electrostatic surface. Uncropped gels are shown in Supplementary Fig. 1. 


and non-canonical R-loops. Such R-loops are difficult to form and 
may not be completely unwound’, requiring higher Cascade con- 
centration*’ and favourable DNA torque’’. They also fail to recruit 
Cas3 directly*, which may indicate that the non-target DNA strand is 
misguided, as Cas3 recruitment is contingent upon non-target strand 
contact as well*’. Overall, these results will facilitate future studies to 
better understand the interference and primed adaptation mechanisms 
in type I CRISPR-Cas systems. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized, and investigators were not blinded to allocation during 
experiments and outcome assessment. 

Expression and purification of Cascade, the Cascade-dsDNA complex and 
Cas3. Sequence information for primers and the synthetic CRISPR expression 
cassette can be found in Extended Data Table 1. Cascade expression was similar to 
ref. 26. In brief, csel] was PCR amplified from E. coli K12 genomic DNA and cloned 
into the pRSF-Duet-ORF1 vector (Kan¥), between Ncol and NotI restriction sites 
(Extended Data Table 1). The cse2-cas7-cas5e-cas6e sub-operon was cloned into 
pET52b (Amp®) between Ncol and NotI; as a Precission cleavable Hiss fusion at 
the N terminus of cse2. The pre-crRNA expression cassette was synthesized by 
Life Technologies and cloned into the pHSG-398 vector (Cam®) (Extended Data 
Table 1). E. coli BL21 (DE3) star cells containing the three plasmids were grown 
in LB medium at 37°C to Agoonm of 0.6. Cascade expression was induced by the 
addition of 0.5 mM isopropyl-6-p- 1-thiogalactopyranoside (IPTG) and cells were 
further cultured at 20°C for 12h. 

The cells were disrupted by sonication in buffer A (50 mM Tris, pH 8.0, 20 mM 
imidazole and 300 mM NaC)), loaded to Ni-NTA column, and eluted with buffer 
A supplemented with 300 mM imidazole. The His tag was cleaved by Precision 
protease, and back-adsorbed with a second Ni-NTA column binding step. Cascade 
was concentrated and buffer exchanged into buffer B (20 mM Tris, pH 7.5, 150mM 
NaCl and 2mM DTT), and further purified on Superdex 200 prep grade column 
(GE healthcare). Free-Cascade containing fractions were pooled, concentrated to 
15mg ml~ ! flash-frozen, and stored at —80°C. 

The Cascade-dsDNA complex was prepared by mixing free-Cascade with 
dsDNA R-loop mimicking substrate. DNA substrates were chemically synthesized 
from IDT (Extended Data Table 1). The non-target strand was annealed with the 
target strand at a 1.5:1 molar ratio. The resulting R-loop mimicking substrate was 
mixed with Cascade at a 2:1 molar ratio, incubated at room temperature for 30 min, 
and re-purified on Superdex 200. Cascade-dsDNA complex fractions were pooled 
and used in crystallization trials. The Cascade-dsDNA complex containing the 
32-nucleotide non-target strand overhang was also obtained using the above proto- 
col, except that the His tag was not cleaved. Cascade mutants were constructed with 
site-directed mutagenesis, and purified using the same method as free-Cascade, 
except that the N-terminal His, tag was left intact. Cascade integrity was checked 
using SDS-PAGE (Extended Data Fig. 4b). 

The Cas3 gene was amplified from E. coli K12 genomic DNA and cloned 
between BamHI and Xhol into the pET28a-SUMO plasmid. E. coli BL21 (DE3) 
star cells were grown in LB medium at 20°C to Agoonm Of 0.3, induced with 0.2 mM 
IPTG, and further cultured at 20°C for 12h. The Ni-NTA and SEC purification 
procedures were similar to the procedure mentioned above. The monomeric 
SUMO-Cas3 fractions were pooled, concentrated to 2 mg ml“!, flash-frozen and 
stored at —80°C until usage in biochemical assays. 

Crystallization and structure determination of Cascade-partial-R-loop 
complex. Cascade complex crystals were grown using the hanging drop vapour- 
diffusion method by mixing 2,1 of purified Cascade-dsDNA complex (15 mg ml") 
with 2 tl of mother liquor (1.6 M Na/K-phosphate, pH 6.2) at 18°C. Initial crystals 
appeared after 2 weeks and grew to full size after ~6 weeks. Crystals were cryopro- 
tected in motherliquor supplemented with 20% ethylene glycol and flash frozen 
in liquid nitrogen. Diffraction data were collected at Advanced Photon Source 


NECAT beamline 24-ID-C and were processed with HKL2000 (ref. 31). The struc- 
ture was solved by molecular replacement with PDB accession 4QYZ as the search 
model. Iterative model building and refinement was conducted with COOT* and 
PHENIX*?. A summary of the diffraction and refinement statistics can be found 
in Extended Data Table 2. The Ramachandran plot for the Cascade-dsDNA 
10-nucleotide overhang structure indicated 96.65% of residues in the favoured 
region, 3.10% allowed, and 0.25% outliers. The Ramachandran plot for the 
Cascade-dsDNA 32-nucleotide overhang structure indicated 94.56% of residues 
in the favoured region, 4.81% allowed, and 0.63% outliers. Figures were gen- 
erated using PyMOL (The PyMOL Molecular Graphics System, version 1.3r1, 
Schrédinger, LLC) and CCP4mg (ref. 34). 

EMSA. Fluorescent dsDNA target substrates were generated for biochemical 
assays. The crRNA-matching targets with varied PAMs were cloned into pCDF- 
Duet between the PstI and Ncol sites Extended Data Table 1. The dsDNA1 and 
dsDNA2 substrates were PCR amplified from the plasmid using the indicated 
fluorescent oligonucleotides (5’ 6-FAM for the non-target strand and 5’ Cy5 for 
the target strand). The dsDNA3 substrates were prepared by oligonucleotide 
annealing. All dsDNA substrates were subsequently gel-purified. The dsDNA1 
substrate (5‘ATG-PAM) was used for all main-text EMSA and Cas3 cleavage assays. 
The dsDNA2 substrates were used for the experiments shown in Extended Data 
Figs 6b, c and 8c. The dsDNA3 substrates were used in the experiments shown 
in Extended Data Figs 6a and 8b, d. DNA binding was conducted in 20 mM Tris, 
pH 7.5, 150mM NaCl and 5% glycerol. The dsDNA substrate concentration was 
held constant at 3nM and Cascade concentration was titrated as indicated. The 
Cascade and dsDNA were incubated at 37°C for 30 min in 20 mM Tris, pH 7.5, 
150mM NaCl and 5% glycerol. EMSA was carried out at 4°C on 2% agarose gels. 
Fluorescent signals were scanned using a Typhoon 9200 scanner. The EMSA 
gels shown in the main and Extended Data figures are representative results, 
out of triplicated experiments. The raw, uncropped source data can be found in 
Supplementary Fig. 1. 

Cascade-mediated Cas3 DNA cleavage assay. Cascade-R-loops were pre-formed 
by mixing 40 nM Cascade or Cascade mutants with 6nM fluorescent dsDNA 
target in binding buffer (5 mM HEPES, pH 7.5 and 60mM KC]) at 37°C for 30 min. 
Cascade-R-loops were then mixed with 500 nM SUMO-Cas3 in DNA cleavage 
RXN buffer (5 mM HEPES, pH 7.5, 60mM KCl, 10mM MgCh and 10|1M CoCh). 
Either 2mM ATP or 2mM AMPPNP was added and the reaction was incubated 
at 37°C for 30 min. Cy5 and 6-FAM fluorescent signals were recorded by Typhoon 
9200 scanner. The wild-type E. coli Cascade specifically nicked the non-target 
DNA strand ~10-12 nucleotides into the R-loop region in the presence of a 
non-hydrolysable ATP analogue, AMPPNP. Addition of ATP triggered proces- 
sive degradation of the non-target DNA strand and distributive degradation of the 
target strand upstream of the R-loop. These results are consistent with previous 
studies!!! Uncropped gels are shown in Supplementary Fig. 1. 
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Extended Data Figure 1 | Electron density of nucleic acids in Cascade-dsDNA complex structure. 2F, — F, electron density map (1.0). Nucleic acid 
strands are shown as sticks and coloured as previously indicated. 
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Extended Data Figure 2 | Comparison between the Cascade-partial-R-loop crystal structure and the Cascade-full-R-loop EM reconstruction. 
Rigid body docking of the partial-R-loop—Cascade crystal structure into the EM reconstruction (EMD-5929) of the full R-loop—Cascade illustrates a 
similar overall conformation, with a correlation value of 0.83. 
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Extended Data Figure 3 | Conformational changes in Cascade upon at the tip of the Csel-CTD slides with the Cse2 dimer ~12 A relative to the 
partial R-loop formation. Comparison of free (PDB 4TVX; in a darker Cas7 scaffold, and protrudes upwards into the R-loop. Csel-NTD remains 
shade) and partial R-loop-bound (this study, in lighter shade) Cascade. in the docked position, stabilized by the clamping of Csel L1 loop onto the 
Arrows indicate the direction of the movement. The CTD of Csel pivots exposed tri-nucleotide motif on crRNA 5/-handle. 


30° about a hinge at the Csel1 NID-CTD interface. The amplified motion 
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Extended Data Figure 4 | Local conformational rearrangement in Csel 
at the proposed Cas3-binding site. a, Vector map showing global and 
local conformational changes in Csel. Csel-NTD (in cyan) undergoes 
moderate movement. Csel-CTD (in green) swings about a pivoting point 
(Trp307) as part of the global conformational changes, The Csel 
NTD-CTD interface (in orange) undergoes considerable local 
conformational rearrangement upon partial R-loop formation. 

b, SDS-PAGE of wild-type and mutant Cascades used in mutagenesis. 
All mutants contained stoichiometric amounts of Csel, except Trp307Ala, 
in which Cse1 failed to assemble. Uncropped gels are shown in 
Supplementary Fig. 1. c, Detailed structure rearrangement at Csel 
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Cse1 (56 kDa) 


Cas7 (40 kDa) 


Cas5e (25 kDa) 
Cas6e (22 kDa) 


Cse2 (19 kDa) 


Cse1 NTD/CTD interface 


Cas3 


NTD-CTD interface, involving amino acids 300-326. Different colours 
are used to differentiate structural elements (amino acid numbers 
tabulated to the right). Partial R-loop-bound and free Cascade structures 
are rendered in solid and semi-transparent cartoons, respectively. 

d, Trp307 from Cse1-NTD rotates inside a hydrophobic socket at 
Csel-CTD during the conformational change (free-Cascade-CTD 
rendered in a darker shade of green). Disruption of this interaction 
(Trp307Ala) caused Csel dissociation from Cascade. e, Docking of our 
structure into the cryoEM reconstruction of the crosslinked Cascade- 
dsDNA-Cas3 complex (EMD-5930). Cas3 density and the location of the 
local rearrangement in Csel are circled to highlight their proximity. 
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Extended Data Figure 5 | Guided entry of dsDNA into Cascade. A series of positively charged residues from Cas7.5, Cas7.6, Cas5e and Cse1 (dark blue 
surfaces) guide the dsDNA into Cascade and towards PAM recognition elements in Csel-NTD. 
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Extended Data Figure 6 | Influence of PAM—1 base-pair composition be ~10, 20, 40 and 100 nM, respectively. c, Cas3 cleavage efficiency was 


on Cascade binding affinity and Cas3 cleavage efficiency. a, Inosine governed by the Cascade affinity for the corresponding PAM-containing 
substitution of Gyr_; led to a mild twofold binding defect, suggesting that _ target (5‘-ATG > ATA >> ATT = ATC). Cascade concentration above 
the N2 amine of Gyr_; is a minor determinant of specificity. b, Binding the Ky led to efficient Cas3 cleavage. Experiments were done in triplicate 
Kg values of E. coli Cascade for all four base-pair combinations at PAM—1. and representative results are shown. Uncropped gels are shown in 

Kj values for 5'-ATG, ATA, ATC and ATT PAMs were determined to Supplementary Fig. 1. 
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A. ferrooxidans THERE MSL ILVPWOYRNLIRGGEDDVPAW 234 

T. fusca YE ILVAFDTDNLIVTPED-RPAW 235 

T. thermophillus ILE BSL LIVELY TPR =---=-— EDDAPIW 215 

E.coli NQPTWIKPIKSNESIPASSI@FVRGLFWQPAHI EIECDPIGIGKCSCCGQESNLRYTGFLK 266 

A. ferrooxidans BRo-=—- DPLGPTRVMRAPDEVCDLFTMOGRRIRBFP--ERRGDAIVVPRVLICAGDEVD 287 

T. fusca RQPPTTAAPADDEELAQRPY@LCDLYTWOSRRI l'Y--DADG----VYGVLLAYGDPLA 289 

T. thermophillus EVPPLRLGDLEGARTKWPLTERTRVYTJPARG’ ILD--EGDG----~-~ VRFEMGYGPGVE 267 

E.coli EKFTFTVNGLWPHPHS PCLVTVKKGEVEEKFLAFTTSAPS(§iTOISRVVVDKIIQNENG-- 324 

A. ferrooxidans RRAARDVDPHVGWR---—-MESRRGAEVSYVPLRARPGQQ’ GLSSVLA---LGAEEQ-- 338 

T.fusca PHNKHNHEPMTAWRRS PAQEKKLKKPQVYLPREHDPTRS. GLGALVAGEASGAEQRGE 349 

T. thermophillus PLEATHRDPMVAQR-~---~~— LDAKGNLLVLRLSEERS DFSAMLP----RQGGK-- 313 

= 

E.coli 

A. ferrooxidans 

T. fusca 

T. thermophillus 

E.coli YGNVINEIVTVGLGYKTALRKALYTFAEGFKN-------------------- KDFKGAGV 413 

A. ferrooxidans QE--DPAAATVAIDAVTFAAHAAQALGYVAEARYLSYDLSFHEESKRHRVPEGKAALAKA 447 

T.fusca ER--DSGLGRTAIKAVEDAEKAVTVLGGLAAD-— LAKAAGADPET 450 

T. thermophillus PK--AEENLEKALKMAEELGOGLKHLAQEVAK~~~~~~-~~~~~~~~~~~~~~~~ AVVGER 403 

a aan 

E.coli SVHETAERHFYRQSELLI PDVLANVNFSQADEVIADLRDKLHQLCEMLFNOSVAPYIAHHP 473 

A. ferrooxidans ARSALAEELYGRLDAPYRHFLTGLANI DDLERPRAEWAALVEAVARDLASRELAQLAPAQ 507 

T. fusca PRAAARDRGFGMLDGPFRTIWLATLAPGTDATERRRAWQQKAHRIISDLG-ROLVAEMGEA 509 

T. thermophillus DRGHGRSPYLEELTKLANSLPLERLYWHALDGAF PRFFARVEEEAS LDLWREALR\ LE 463 

we * Hee 

E.coli ---TLALARATLYKHLR@LKPQGGPSNG---- 502 

A. ferrooxidans -VAGEDRFRRMLARARNFSPSDSPEKGAA-- 540 

T. fusca PMATSEQTGEAAS 555 

T. thermophillus AWKATR----~-~~—! RFLGTGARHLKALAQGEOIQFGRLLGELGEEVRT 502 

* one 
b 
E. coli 
T. thermophillus 
A. ferrooxidans 
T. fusca 

Glycine loop region (ds-PAM minor groove) Glutamine wedge region 
Extended Data Figure 7 | Conservation of PAM recognition elements. thermophilus (PDB 4AN8), Acidithiobacillus ferrooxidans (PDB 4H3T) 
a, Sequence alignment of Cse1 proteins with known structures. Identical and T. fusca (PDB 3WVO) were superimposed with the Cse1 in our partial 
residues are highlighted in red, conserved residues in red text. R-loop-forming E. coli Cascade structure. Despite sequence variation, a 
DNA-contacting residues in the Csel-NTD and -CTD are marked by cyan _ glycine-rich loop is present in each Csel structure, and probably has a 
and green asterisks, respectively. The glycine loop and glutamine wedge similar function to recognize PAM from the minor groove (left inset). 
are in blue and orange boxes, respectively. Residues mediating the Trp307 The glutamine-wedge protrusion is highly conserved in 3D. Each wedge 
ball-and-socket interaction are marked with black asterisks. b, Structural features a long side chain at the tip (right inset), which probably stacks 
alignment of Csel PAM recognition elements. Apo Csel from T. Thermus underneath PAM in a similar fashion. 
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Extended Data Figure 8 | Rationalization of PAM—2 and PAM—3 c, Whereas Lys268Ala contained reduced affinity for the correct PAM, it 
specificity using nucleotide substitution and modelling. a, Modelling of _ still possessed strong discrimination against Gy—-Cyr7 at PAM—3 (5/-CTG), 
alternative base pairs at PAM—2 suggests that only the N2 amine of Gr_2 suggesting the further presence of a mechanism to reject Gr_3. d, Inosine 
would cause steric clashes with C,, of Gly160 (bottom right quadrant), substitution of Gr_3 restored the Cascade binding Ky to ~40 nM, leading 
this amine therefore may serve as the anti-determinant for the rejection to the conclusion that the N2 amine of Gr_3 is a minor determinant of 
of Gr-Cyr at PAM—2. b, EMSAs demonstrating that removal of specificity. Experiments were done in triplicate and representative results 
this amine in inosine substitution rescued the Cascade binding defect. are shown. Uncropped gels are shown in Supplementary Fig. 1. 
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Extended Data Table 1 | Oligonucleotides used in this study 


Oligonucleotide 
Csel (forward) 


Cse1 (reverse) 
Cse2-Cas7-CasSe-Cas6e 


(forward) 


Cse2-Cas7-CasSe-Cas6e 
(reverse) 


Cse2-Cas7-CasSe-Cas6e 
(forward) 


R-loop mimic (target strand) 


R-loop mimic (non-target 
strand) 


SUMO-Cas3 (forward) 
SUMO-Cas3 (reverse) 


Target plasmid construction 
(non-target strand) 


Target plasmid construction 
(target strand) 


dsDNA1 Fluorescent ATG PAM 
substrate (forward) 


dsDNA1 Fluorescent ATG PAM 
substrate (reverse) 


dsDNA2 Fluorescent substrate 
(forward) 


dsDNA3 (non-target strand) 


dsDNA3 (target strand) 


crRNA expression cassette 


Notes 
Ncol + Cse1 fragment (pRSF-Duet ORF1) 


Sequence (5'-3') 


GCGCGCCATGGCTAATTTGCTTATTGATAACTGGATCC 
GGCCCGCGGCCGCTCAGCCATTTGATGGCCCTCC Cse1 fragment +Notl (pRSF-Duet ORF1) 


GCGCGGGTACCAGATGGCTGATGAAATTGATGCA Kpnl + Cse2-Cas7-Cas5e-Cas6e fragment (pET52b) 


GCGCGCCATGGGTCATCACCACCATCATCACGGTGCACTTGAAGTCCTCTTTC 
CTGTTGGCAAGCCAGGATCTGAACAATACCGTCATCGAGCACTGCACAGA 


TCTGTGCAGTGCTCGATGTTTTATTTAT 
GCGCCGCGGATCCATGGAACCTTTTAAATATATATGCCAT 
GCGCCGCCTCGAGTTATTTGGGATTTGCAGGGAT 


ACGGTATTGTTCAGATCCTGGCTTGCCAACAGCTGCA 


Ncol + 6XHIS + Precission (pET-52b) 


BamHI + Cas3 fragment (pET28b-SUMO vector) 


Cas3 fragment + Xhol (pET28b-SUMO vector) 


CATGG: 


Ncol overhang + Target sequence + Pstl overhang 
(pCDF-Duet1) PAM altered as indicated in text 


GCTGTTGGCAAGCCAGGATCTGAACAATACCG' Pst! + non-target sequence + Ncol (pCDF-Duet1) 


PAM altered as indicated in text. 


5’- 6-FAM label, PCR product used in main text EMSA 
and Cas3 cleavage assays. 


AACTTTAATAAGGAGATATACCATGGATG 
GCGGCCGCAAGCTTGTC 


TAATACGACTCACTATAGGG 


5’- Cy5 label, PCR product used in main text EMSA and 
Cas3 cleavage assays. 
T7 Forward 5’-6-FAM label. Used with dsDNA11 (reverse) 
primer to amplify set of 5’-ATA, ATT, ATC, ATG, AGG and 
CTG PAM substrates from target plasmids. 


ACGGTATTGTTCAGATCCTGGCTTGCCAACAGCTGCAGGTCGACA 


TGTCGACCTGCAGCTGTTGGCAAGCCAGGATCTGAACAATACCG (CCATGTATATCT 


GCGCCGGGAATICCCTGCATIAGG 
GACGGTATTGTTCAGATCCTGGCTTGCCAACAGGAGTTCCCCGCGCCAGCGGGGATAAACCGACGGTATTG 

TTCAGATCCTGGCTTGCCAACAGGAGTTCCCCGCGCCAGCGGGGATAAACCGACGGTATTGTTCAGATCCTG 
GCTTGCCAACAGGAGTTCCCCGCGCCAGCGGGGATAAACCGACGGTATTGTTCAGATCCTGGCTTGCCAAC 

AGGAGTTCCCCGCGCCAGCGGGGATAAACCGACGGTATTGTTCAGATCCTGGCTTGCCAACAG' 


5’ 6-FAM or HEX label, altered (base mutation or 
inosine substitution) as indicated in text. 


altered (base mutation or inosine substitution) as 
indicated in text. 


The crRNA expression cassette containing six identical 
repeat-spacer units was de novo synthesized and 
inserted to the pHSG-398 vector. The T7 promoter, T7 
terminator, and the repeats are highlighted green, red, 

and yellow, respectively. 


(CGCGCCAGCGGGGATAAACCGACGGTATIGTTCAGATCCTGGCTTGCCAACAG 


IGGACCGGATCCCCG 
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Extended Data Table 2 | Data collection and refinement statistics 


Cascade-dsDNA 


Cascade-dsDNA 
(32-nt non-target spacer) 


Data collection 
Space group 
Cell dimensions 

a, b, c (A) 

a, B,y (°) 
Resolution (A) 
Rmerge 
Roim 
T/ol 
Completeness (“%) 
Redundancy 
CC(1/2) 


Refinement 

Resolution (A) 

No. reflections 

Rwork / Rite 

No. atoms 
Macromolecule 
Ligand/ion 
Water 

B-factors 
Macromolecule 
Ligand/ion 
Water 

R.m.s. deviations 
Bond lengths (A) 
Bond angles (°) 


P22) 21 


92.98, 150.06, 400.55 
90, 90, 90 

50.0-2.45 (2.49-2.45) 
0.169 (1.288) 

0.050 (0.531) 

12.9 (1.1) 

99.7 (97.2) 

12.1 (6.5) 

0.997 (0.496) 


50.0-2.45 
205,381 
0.2058 / 0.2327 


27896 
1 (Zn) 
1051 


53.60 
51.90 
45.90 


0.013 
0.840 


*One crystal was used for each structure. 


*Values in parentheses are for highest-resolution shell. 
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P22) 21 


92.81, 149.83, 404.10 
90, 90, 90 

50.0-3.20 (3.26-3.21)* 
0.265 (1.066) 

0.143 (0.667) 
581-5) 

96.8 (84.1) 

3.6 (3.0) 

0.967 (0.422) 


50.0-3.21 
90,215 
0.2087 / 0.2470 


27237 
1 (Zn) 
0 


62.10 
61.50 
N/A 


0.018 
1.010 


CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/nature16074 


Corrigendum: NLRP10 is a NOD -like 
receptor essential to initiate adaptive 
immunity by dendritic cells 

Stephanie C. Eisenbarth, Adam Williams, Oscar R. Colegio, 
Hailong Meng, Till Strowig, Anthony Rongvaux, 

Jorge Henao-Mejia, Christoph A. Thaiss, Sophie Joly, 

David G. Gonzalez, Lan Xu, Lauren A. Zenewicz, 


Ann M. Haberman, Eran Elinav, Steven H. Kleinstein, 
Fayyaz S. Sutterwala & Richard A. Flavell 


Nature 484, 510-513 (2012); doi:10.1038/nature11012 


In this Letter, we reported that NLRP10-deficient mice had no defect 
in inflammasome function in macrophages or dendritic cells (DCs). 
Instead, a loss of T-cell-dependent immune responses was seen in these 
mice secondary to a defect in DC migration. We have since noticed a 
change in the phenotype of the NLRP10-knockout mice involving DC 
migration, after backcrossing them onto backgrounds such as FVB or 
BALB/c (see Supplementary Methods). We used whole-exome sequenc- 
ing of these mice to determine whether loss ofa different gene accounted 
for the observed phenotype. Six homozygous mutations/indels were 
found (sequence deposited in NCBI Sequence Read Archive under 
accession SRR1792904); one was a homozygous point mutation in the 
Dock8 gene (Fig. 1a). DOCK8 is a guanine nucleotide exchange factor 
that has been shown to regulate the GTPase CDC42 in mouse DCs, 
and biallelic mutations in humans cause a severe immunodeficiency 
syndrome (OMIM 611432)”. This point mutation results in a pre- 
mature stop codon in the second to last exon, and we believe results in 
nonsense-mediated decay of Dock8 messenger RNA (Fig. 1b). Dock8 
mRNA is significantly reduced in DCs from the original Nirp10~/~ 
strain, but is restored when the mice are bred onto other wild-type back- 
grounds (Fig. 1c). The Bruce4 embryonic stem cells used to generate 
the NLRP10-deficient mice do not contain the mutation after sequenc- 
ing (see Fig. 1a for primers and nucleotide change). It may have arisen 
spontaneously as a natural mutation early during the intercrossing of 
the knockout mice, and became fixed in our colony before the original 
phenotype analysis. From a previous proteomic screen, we identified 
DOCK8 as a candidate molecule involved in the loss of coordinated 
DC movement*. We tested whether this point mutation accounted for 
the loss of DC migration in vivo and adaptive immune responses. When 
mice were bred to isolate the NLRP10 deficiency, we found that DC 


5’UTR ATG Dock8 gene Exon 47 Stop 3’UTR 
WT 


migration was independent of NLRP10 (Fig. 1d). However, some aspects 
of the phenotype originally reported were maintained. For example, iso- 
lated NLRP10 loss does not result in enhanced NLRP3 inflammasome 
activation®. In addition, aberrant Gdpd3 upregulation was due to Nirp10 
loss and was not affected by the Dock8 gene’. Notably, spontaneous Dock 
gene loss of function has been reported in many unrelated knockout 
mouse strains. For example, Dock2 loss was recently reported in ASC- 
knockout mice*. In addition, Dock2 mutations have been found in a 
subset of IRF5-deficient strains®. These separate instances suggest that 
the Dock gene family is prone to mutation, whether because of the large 
size (average of 275 kb across the 11 family members) or something more 
specific to these genes themselves, and that these mutations are readily 
discovered owing to the induction of a profound phenotype. Regardless of 
the aetiology of the mutations, these cases highlight the need for contin- 
ual evaluation of genetic stability of inbred mouse strains, especially those 
expanded by sibling matings, as is common practice in many animal 
facilities. In addition, to limit accumulation of mutations in inbred strains 
as seen in our mice from the exome sequencing, periodically obtaining 
new breeding stock from founder lines or cryopreserved embryos from 
early generations has been recommended’. The original mouse pheno- 
type reported in our Letter is correct and highly reproducible, but the 
conclusions must be reinterpreted in light of this new finding. We con- 
clude that (1) DC migration is dependent on DOCK8 and not NLRP10; 
(2) NLRP10 deficiency is associated with aberrantly upregulated GDPD3; 
and (3) NLRP10 deletion does not alter NLRP3 inflammasome activity. 
We regret any confusion that this may have caused. 
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Supplementary Information is available in the online version of the Corrigendum. 
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Figure 1 | NLRP10-deficient mice contain a Dock8 point mutation 
resulting in failed DC migration. a, Dock8 gene structure, highlighting 
homozygous point mutation in exon 47 in NLRP10-knockout mice. 

b, Resulting coding changes in DOCK8 induced by point mutation in 

a. c, Unstimulated bone-marrow-derived DCs from the original 
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NLRP10-deficient mice with a Dock8 mutation had significantly 

reduced Dock8 mRNA. When the Dock8 mutation was corrected through 
breeding, DCs from NLRP10-deficient mice showed no alteration in 
Dock8 mRNA (relative to Hprt). d, In vivo DC migration to inguinal lymph 
nodes (see Supplementary Methods). 
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Frustrated 
postdocs rise up 


US postdoctoral researchers struggle with pay and career 
prospects. But some groups are fighting back. 


BY PAUL SMAGLIK 


he tribulations of US postdoctoral 
[Fess — low stipends, spotty 

benefits, uneven training and hazy 
career prospects — have received plenty of 
airtime in the past few years. But efforts to 
combat these woes have been achingly slow. 
Frustrated that recommendations by univer- 
sities, policymakers and other stakeholders 
are not quickly bearing fruit, grass-roots and 
advocacy groups are trying to address some of 
the problems on their own. 


High-level reports have noted the problems 
for more than a decade. The US National Acad- 
emy of Sciences (NAS) outlined the hardships 
faced by science and engineering postdocs in 
2000, and its 2014 follow-up report noted that 
the issues persist and have begun to bleed into 
other disciplines. A study in the Proceedings 
of the National Academy of Sciences (PNAS) 
that same year found that the US research 
enterprise prepares most of its postdocs and 
graduate students for tenure-track positions 
that only 20% of them are likely to secure 
(B. Alberts et al. Proc. Natl Acad. Sci. USA 111, 


5773-5777; 2014). The reports have called for 
measures such as higher stipends, set training 
periods and broader career training. 

“There’s a frustration with constantly talking 
about it and constantly writing about it,’ says 
Gary McDowell, a developmental-biology 
postdoc at Tufts University in Boston, Massa- 
chusetts, and a member of Future of Research 
(FOR), a Boston-based group that is working 
to improve postdoc pay and benefits. Other 
organizations are fighting their own battles. A 
postdoc group at Stanford University in Cali- 
fornia helped to push for higher stipends there. 
Another group at the University of California, 
San Francisco (UCSF), expanded an intern- 
ship programme to allow postdocs in addi- 
tion to graduate students. And a New York 
University (NYU) group launched an effort to 
battle isolation among its postdocs, who are 
scattered throughout the metropolitan region, 
and provide them with professional network- 
ing and collaboration opportunities. 

Postdocs stand to gain much by becoming 
active in such local and regional organized 
groups — including developing skills that 
may help with their career (see ‘Activism pays 
off’). Change can happen much more quickly 
at local levels. And postdocs don’t have to wait 
for a national level, universal solution that may 
never materialize. “I feel that there’s a lot more 
hope,’ says McDowell. 

Money, for many postdocs, is the number- 
one issue. The Stanford University Post- 
doctoral Association (SURPAS) has already 
made independent strides in this arena. 
Last year, it negotiated for higher stipends 
and, in October, it won a minimum level of 
US$50,000 a year for all 2,000 Stanford post- 
docs, regardless of their funding source, and a 
higher rate for those with more than 3 years in 
the position. That rate is 17% higher than the 
US National Research Service Awards mini- 
mum of $42,840. 

SURPAS has also helped Stanford postdocs 
to defray their commuting expenses. Faculty 
members and staff have historically received 
free rail commuter passes, but Stanford post- 
docs had not been entitled to that benefit. 
In 2012 and 2013 polls, SURPAS found that 
many were upset about commuting outlays. 
With support from faculty and staff allies, it 
secured a pilot programme in 2014 to extend 
the free rail pass to postdocs: Stanford post- 
docs previously spent up to $2,100 a year on 
train commutes; now they spend $190. Stan- 
ford administrative offices are evaluating the 
pilot programme for renewal this year. > 
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> Other organizations focus on training 
that prepares postdocs for jobs beyond aca- 
demia, because reports continue to document 
the dearth of jobs in the sector. Postdocs need 
time away from the bench to learn career- 
related skills, but they often can’t get it, even 
when their host universities offer such pro- 
grammes for graduate students. Such was 
the case at UCSE, where postdocs had been 
ineligible for career-exploration internships. 
P-Value, a UCSF-based postdoc-advocacy 


group that formed in 2014, changed that. 
After a forum last March to discuss career 
training, members united to request that UCSF 
support short, flexible internships for post- 
docs in fields such as intellectual property and 
technology transfer. The university admin- 
istration agreed to sponsor campus-based 
internships and created a pilot that launched 
last September and has drawn half a dozen 
participants so far. One of them, Jessica Lao, 
teaches a biochemistry class at San Francisco 


ACTIVISM PAYS OFF 


Postdocs who help colleagues also help themselves 


Postdocs are drawn to grass-roots activism 
to create better training conditions for 
themselves and their peers. In doing 

so, they can pick up and polish skills 

such as negotiation, time management, 
communication and leadership that aid 
their own career development. 

Amita Bansal, a postdoc at the 
University of Pennsylvania in Philadelphia 
and member of the Penn Biomedical 
Postdoctoral Council, learned all of those 
skills when she developed a programme 
to help foreign national postdocs integrate 
into the Penn postdoc community. To 
get ideas for events, she reached out 
to fellow postdocs who were foreign 
nationals. And she had to negotiate with 
university leadership to get approval for the 
programme, which, she says, can draw up 
to 100 attendees depending on the type of 
session or topic. “I’ve learned how academic 
structure and administrative hierarchy 
work,” she says. 

The planning aspect of the programme — 
inviting and setting up speakers, arranging 
sponsorship and securing venues — has 
forced her to hone her organizational skills. 
Although sessions take place after working 
hours, she sets aside an hour or so daily for 
tasks that must be done during the daytime 
and spends time in the evening and on 
weekends on others, such as e-mails and 
text messages. 

“Time management is extremely crucial,” 
she says. 

That same skill has also played a major 
role in Rodoniki Athanasiadou’s advocacy 
efforts. The New York University postdoc 
formed a regional branch of the postdoc 
corps Future of Research (FOR) and 
co-organized and presented a day-long 
seminar for the branch last year. Like 
Bansal, she has learned to fit in e-mails and 
phone calls around her benchwork. 

Advocacy efforts can teach postdocs 
to tailor their message. Jessica Polka, a 
postdoc at Harvard Medical School in 
Boston, Massachusetts, is active in the 


Jessica Polka leads a FOR meeting in Boston. 


Boston-area FOR group and is a member of 
a national-level steering committee that is 
gathering information on postdoc problems 
and employment. In these roles she works 
with university administration, heads and 
members of postdoc organizations and 
national policy leaders, and has learned 

to adapt her messages to appeal to each. 
“Speaking about topics other than science 
has helped me communicate more 
effectively,’ Polka says. 

J.T. Neal, a postdoc at Stanford University 
School of Medicine in California and 
a member of the Stanford University 
Postdoctoral Association, has noticed a 
similar benefit from his work to negotiate 
postdoc stipends with the university. 

“The whole process taught me about 
identifying each party’s points of conflict 
and working collaboratively to address 
them,’ he says. “And | learned to be patient, 
as the pace of change in institutions of 
higher learning can be glacial.” 

For both Polka and her fellow FOR 
member Gerry McDowell, a postdoc at Tufts 
University in Boston, their advocacy efforts 
have also opened potential career paths 
beyond research. “| am now more aware 
of the super-important work in policy and 
communications,’ Polka says. “I would be 
very happy in either of these areas.” PS. 
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State University under the programme. A post- 
doc at UCSF’s medical school, she decided to 
explore a career in teaching and wanted class- 
room experience. The course requires up to 
six hours a week of lecture and office time, 
and more for preparation, but her lab head 
has been supportive of her need for time away 
from the bench, she says. “He understands that 
it’s essential” 

Katherine Thompson-Peer, co-chair of 
P-Value, attributes the success of the pilot to 
building it on an existing scheme, rather than 
trying to create a new one. She is optimistic 
that UCSF will formalize the programme when 
the pilot period ends. 

These regional efforts are spreading nation- 
ally as local organizations seek to link with and 
support their counterparts elsewhere. SURPAS 
has created a postdoc advocacy plan for adap- 
tation by other groups. FOR has extended into 
New York and Chicago, Illinois. Along with 
providing mutual advocacy, this expansion 
can also boost potential research connections. 
FOR New York's regular meetings and other 
events, for example, have helped members 
from different disciplines to form bonds and 
identify common research interests. The group 
organized and presented an interdisciplinary 
research symposium last year that included 
sessions on career issues and research policy. 
“This is quite an achievement,’ says FOR mem- 
ber Rodoniki Athanasiadou. “Historically, 
postdocs in different departments at NYU 
have not interacted scientifically: 

Postdoc-advocacy groups have also 
triumphed together in another battle. The 
paucity of concrete data about employment 
trends for post-postdocs has plagued trainees 
for decades and stymies their career-planning 
efforts. Some groups are gathering their own 
data by tracking alumni. 

At a national level, postdoc groups have 
been pressuring the US National Science 
Foundation to collect these data. Those efforts 
led last year to the creation of a website by the 
authors of the PNAS and NAS reports that aims 
to collect and organize such data. The web page 
also hosts suggestions to improve postdocs’ 
training and funding, among other issues. 
Jessica Polka, a postdoc at Harvard Medical 
School in Boston, Massachusetts, says that 
the site provides a ‘bottom-up’ way for groups 
such as FOR to directly address top biomedical 
policymakers. She isa member of FOR Boston 
and of the site’s steering committee. 

Thompson-Peer, for one, says that she is 
delighted to have the opportunity to contrib- 
ute to the national dialogue. The voices of 
postdoc organizations are growing stronger, 
she says, and postdocs have far more leverage 
as members of a collective group than they do 
individually. “We cannot make these structural 
changes on our own,’ she says. = 


Paul Smaglik is a freelance writer in 
Milwaukee, Wisconsin. 
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Changes on the ice 


Diverse faces are appearing on a frozen continent, says Robin Bell. 


r Yo measure gravity from the sky, you 
need a calm day and a steady aircraft. 
Often, the instruments involved do 

not need much tending to, even when flying 
at low altitude over the frozen Weddell Sea off 
the Antarctic Peninsula. As a graduate student 
in 1986, I was lucky: my experiments mostly 
worked well. In moments of peace, I could sneak 
to a window in the aircraft's belly. 

As my equipment collected data that would 
reveal the hidden sea bed, I would spend hours 
watching the sea below. My fellow scientists 
and I were far from any place that a wheeled US 
Navy P-3 Orion aircraft could safely land. The 
polar ice, floating and flowing, was beautiful. 
Leopard seals looked like slugs from 150 metres 
up. They gathered around the large tabular ice- 
bergs where our pilots planned to ditch if our 
plane lost an engine. 

During these expeditions, I avoided stories 
of polar explorers such as Shackleton, Scott 
and Amundsen. It seemed impossible to relate 
to these all-male teams. The other graduate 
student on the trip was also a woman, geophysi- 
cist Carol Raymond. She now co-leads NASAs 
Dawn mission, which is orbiting the proto- 
planet Ceres. We wore paratrooper boots and 
helmets like the rest of the team, and were still 
mistaken for secretaries. Both of us experienced 
what would today be considered harassment. 
As students, we dared not 


utter a word. Such behav- “We wore 
iour by male colleagues paratrooper 
was considered part of the boots and 
job, or the woman's fault. helmets, and 
Role models were few were still 
and far between back mistaken for 


then. There were no _ secretaries.” 
female Earth-science 

faculty at Columbia University in New York 
where I did my PhD and work today. Marine 
geologist Marie Tharp, who was the first to 
map the mid-ocean ridge, would occasionally 
appear on campus. Deep in mourning for her 
life partner and collaborator, who had died 
in a submersible at the bottom of the Atlantic 
Ocean, she had little time for students such as 
Carol and me. Outside Columbia, we looked 
to geophysicist Marcia McNutt, who was said 
to file her nails on the fantails of research ships 
before tossing dynamite overboard to blast 
through the ice. 

But just as geophysicists’ instruments have 
captured changes in the ice sheets, we are seeing 
changes in the scientific establishment. Women 
now routinely lead expeditions. Institutions take 


Geologist Christine Siddoway maps faults in Marie Byrd Land, West Antarctica. 


harassment seriously. The rising generation of 
scientists is unwilling to accept the hostility that 
drove many of my friends from science. Female 
scientists are speaking up; they are being heard, 
and they are getting work done. 

These days, when I look out of the aircraft 
that carries an imaging system built by my 
research group, I still see ice — flowing, fold- 
ing and rotating in its many beautiful forms. 
But if I close my eyes, I do not picture the frost- 
encrusted beards ofan all-male expedition. 


FEMALE FACES 

I see palaeoclimatologist Dorthe Dahl-Jensen 
on the Greenland ice sheet, sheltering in a 
tiny tent while running a winch that drags 
an instrument through a borehole. She has 
drilled ice cores in Greenland and Antarc- 
tica to assemble spectacular records of the 
changing climate. 

I see geologist Christine Siddoway, who is 
trying to figure out how the Antarctic continent 
tore off from New Zealand. She once spent a 
sleepless night in her camp as hurricane-force 
winds lifted snowmobiles into the air and 
shredded tents. 

I see microbiologist Jill Mikucki, with a 
satellite phone glued to her ear, leading an inter- 
national team to study ecosystems under the ice. 

And I see so many more: glaciologist 
Helen Fricker, installing Global Positioning 


System base stations on the Amery Ice Shelf 
in Antarctica; Indrani Das, who was drawn 
by the beauty of snow to leave astrophysics; 
and Adrienne Block, whose PhD dissertation 
took her to a remote camp in East Antarctica. 
It brings me great joy to see so many women 
among the ranks of polar scientists. 

Last November, my team was deployed 
to Antarctica to launch a programme called 
ROSETTA-Ice. We used gravity to map the 
Ross Ice Shelf and to uncover its tectonics, an 
idea that Siddoway and I thought of more than 
a decade ago. The fieldwork was led by Kirsty 
Tinto; I met her eight years ago at McMurdo 
Station, the main US base in Antarctica. I had 
been lost in logistics — how to get my team 
safely atop the ice sheet, along with the fuel — 
when Tinto walked down the hall that serves 
as the main street of that small science town. 
The young geophysicist had just returned from 
weeks on the ice. Her laughter and confidence 
caught my eye: she fit my idea of what a scientist 
should look like. 

Now, I don't have to shut my eyes to see 
scientists in all shapes, sizes, nationalities and 
genders. It is this rich array that buoys meas I 
watch the ice sheet shift. m 


Robin Bell is a research professor at the 
Lamont-Doherty Earth Observatory of 
Columbia University in Palisades, New York. 


25 FEBRUARY 2016 | VOL 530 | NATURE | 507 


© 2016 Macmillan Publishers Limited. All rights reserved 


Ua SCIENCE FICTION 


DU 


K, DUCK, DUCK 


BY SAMANTHA MURRAY 


¢C C uck, duck, duck,” says Maddy, 

D touching each head in the circle 

with a perfunctory authority. 

“Duck, duck,’ she pauses almost impercep- 

tibly at the curly blonde head in front of her, 
“alien” 

The owner of the curls, Rebecca, 
scrambles to her feet as Maddy 
tears off round the circle. 

“Go Maddy, go Maddy, go 
Maddy,” the other children 
screech, their voices getting 
higher and more feverish as 
Maddy passes the empty 
space and starts on her 
second lap around. 
No one cheers for 
the little blonde 
girl. No one wants 
the alien to win. 

“Go Maddy,’ yells 
Candice, along with the 
others, feeling the tightness in 
her chest, her hands making little 
crescent moon marks in her palms. 
The alien is gaining on Maddy, despite 
her head-start. Its arms look longer than 
they should stretching out, stretching out... 
as Maddy skids into the vacant spot and fills 
it with her warm panting body. The circle is 
complete again. 

Rebecca stops. Her face is red and there is 
alittle bit of spit at the corner of her mouth. 
Then she smiles and starts walking, a slow 
measured gait. 

“Duck, she says. “Duck, duck, duck” 

The teachers don't like this game, 
Candice knows, but they don’t cross over the 
expanse of bitumen to the grassed area that 
skirts the oval where the kids play it at recess 
time. Everyone plays it now, the climbing 
frames are desolate in the sun and the skip- 
ping ropes and footballs are left in the sports 
cupboards. 

The teachers dont like the game, prob- 
ably because they dontt like to talk about the 
aliens at all. Candice’s mother doesn’t talk 
about them either, her eyes went all shifty 
and she gota worried crease in her forehead 
the one and only time Candice brought it up. 
She switches off the news when Candice is 

around, but Candice 


> NATURE.COM knows anyway. The 
Follow Futures: kids all know. 

Y @NatureFutures The first sign of 
Ei go.naturecom/mtoodm © turning into an alien 


It’s no game. 


is that your eyes start to look like glass, all 
reflective and shiny and hard. And you are 
cold, cold, cold, because something differ- 
ent is happening to your blood. The last, and 
final, sign is that you start to go transparent, 
so that you can see your veins and bones and 
intestines and gross things through your 


skin. Candice doesn’t know what happens 
after that, she knows that they quarantine 
you and stick you with needles and prob- 
ably try to get you to tell them where the 
other aliens are and what plans they have 
for invading Earth. 

Candice looks towards the teacher on 
duty, standing near the classroom under the 
shade of a large box tree, every green leaf 
glinting. Candice doesn't wear a watch, but 
all of the kids know by some kind of internal 
instinct that recess time is nearly over. The 
game has picked up an added urgency. A 
dark-haired girl from the grade below Can- 
dice trips on her way round the circle and 
only just avoids the alien. She looks like she 
is going to cry. 

When the siren goes all of the kids will 
scatter like ash to the wind, helter-skelter- 
ing back to the school buildings. If you are 
stuck being the alien at that point you will 
have to be the alien all day. No one will want 
to sit next to the alien, or pass notes to it, 
or walk home with it. Even the teachers 
can somehow detect the stench of alien 
and will respond to it with a distaste that is 
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almost, almost, but not quite, concealed. 

Candice knows that the aliens have an 
“R nought of exactly one. She doesn’t know 
where she heard that, but she knows what 
it means. It means an alien will always pass 
it on to just one person. An alien might 
have contact with many people, but it will 
only go “duck” and “duck” and “duck”, until 
it finds the one person who is exactly and 
only right. 

You didn’t show that you were an alien 
immediately. There would be a time 
when you were still yourself but 
you were really an alien but you 
didn’t know it. Maybe you just 
felt colder inside, cold, cold 
like her Dad’s hands when 

hed clutched her when he 
picked her up on Friday 
because it was his weekend 
with her. 
Hed been cold and his 
eyes had been glinty and 
distracted all weekend, 
and when hed dropped her 
at school Candice had felt all 
sad and lost although she didn't 
usually, usually it was fine. 
Maybe you would feel cold and 
maybe everything would look sharper 
like the leaves on the box tree even from so 
far away, and maybe your blood wouldnt go 
in hot spurts anymore, but instead be cool 
and steady like cooled lava — dark and intri- 
cate, swirled and ropey. 

“Duck,” says Gracie, who used to be 
Candice’s best friend last year. 

Candice looks back to see the teacher 
looking at her wrist. She can see the little 
lines cross-hatched in the skin below her 
eyes, and the beads of sweat to the side of 
her nose, even though the teacher is all the 
way over near the classroom. 

“Duck,” says Gracie, her footsteps 
approaching, and were they slowing, just a 
little? 

I don't want to be the alien, thinks Can- 
dice. I don’t want to be the alien, I don’t want 
to be the alien. 

“Duck,” says Gracie. = 


Samantha Murray is a writer, actor, 
mathematician and mother. Not 
particularly in that order. Her fiction has 
been seen in Lightspeed (Women Destroy 
Science Fiction!), Flash Fiction Online, 
Daily Science Fiction and Writers of the 
Future, Vol. 31, among others. 
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